Recombinant dengue virus antigens comprising the C15 signal peptide, full-length PRM protein, and full-length E protein

ABSTRACT

The invention provides polynucleotides and polypeptides encoded therefrom having advantageous properties, including an ability to induce an immune response to flaviviruses. The polypeptides and polynucleotides of the invention are useful in methods of inducing immune response against flaviviruses, including dengue viruses. Compositions and methods for utilizing polynucleotides and polypeptides of the invention are also provided.

CROSS REFERENCE TO RELATED APPLICATION

This application claims priority to and benefit of provisional U.S. Patent Application Ser. No. 60/360,030, filed Feb. 26, 2002, the disclosure of which is incorporated herein by reference in its entirety for all purposes.

STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH AND DEVELOPMENT

This invention was developed in part with Government support by a grant from the Defense Advanced Research Projects Agency (DARPA) (Grant No. N65236-99-1-5421). The Government may have certain rights in this invention.

COPYRIGHT NOTIFICATION

Pursuant to 37 C.F.R. 1.71(e), Applicants note that a portion of this disclosure contains material which is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or patent disclosure, as it appears in the Patent and Trademark Office patent file or records, but otherwise reserves all copyright rights whatsoever.

FIELD OF THE INVENTION

This invention pertains generally to polypeptides that induce an immune response against one or more dengue viruses and/or other flaviviruses, polynucleotides encoding such polypeptides, methods of making and using such polypeptides, polynucleotides, and diagnostic assays employing such polypeptide and polynucleotides.

BACKGROUND OF THE INVENTION

Viruses of the Flavivirus genus are positive-sense, single-stranded RNA viruses of the Flaviviridae family, many of which are responsible for disease in humans and other mammals. Examples of flaviviruses include Tick-borne encephalitis virus, Japanese encephalitis virus, Yellow Fever virus, St. Louis encephalitis virus, hepatitis C virus, and West Nile viruses. Flaviviruses of the Flavivirus genus generally comprise three major mature structural proteins: an envelope (E) protein, a capsid (C) protein, and a membrane (M) protein. The M protein is usually formed as a proteolytic fragment of a pre-membrane (prM) protein. See, e.g., FIELDS VIROLOGY 997-998, Raven Press, Ltd., New York (D. M. Knipe et al. eds., 4th ed., 2001), 996 (hereinafter “FIELDS VIROLOGY”), and the ENCYCLOPEDIA OF VIROLOGY (R. G. Webster et al. eds., Academic Press, 2nd ed., 1999), each of which is incorporated herein by reference in its entirety for all purposes.

Dengue (DEN) viruses are known among flaviviruses as agents of disease in humans. Dengue viruses comprise four known distinct, but antigenically related serotypes, named Dengue-1 (DEN-1 or Den-1), Dengue-2 (DEN-2 or Den-2), Dengue-3 (DEN-3 or Den-3), and Dengue-4 (DEN-4 or Den-4). Dengue virus particles are typically spherical and include a dense core surrounded by a lipid bilayer. FIELDS VIROLOGY, supra.

The genome of a dengue virus, like other flaviviruses, typically comprises a single-stranded positive RNA polynucleotide. FIELDS VIROLOGY, supra, at 997. The genomic RNA serves as the messenger RNA for translation of one long open reading frame (ORF) as a large polyprotein, which is processed co-translationally and post-translationally by cellular proteases and a virally encoded protease into a number of protein products. Id. Such products include structural proteins and non-structural proteins. A portion of the N-terminal of the genome encodes the structural proteins—the C protein, prM (pre-membrane) protein, and E protein—in the following order: C-prM-E. Id. at 998. The C-terminus of the C protein includes a hydrophobic domain that functions as a signal sequence for translocation of the prM protein into the lumen of the endoplasmic reticulum. Id. at 998-999. The prM protein is subsequently cleaved to form the structural M protein, a small structural protein derived from the C-terminal portion of prM, and the predominantly hydrophilic N-terminal “pr” segment, which is secreted into the extracellular medium. Id. at 999. The E protein is a membrane protein, the C-terminal portion of which includes transmembrane domains that anchor the E protein to the cell membrane and act as signal sequence for translocation of non-structural proteins. Id. The E protein is the major surface protein of the virus particle and is believed to be the most immunogenic component of the viral particle. The E protein likely interacts with viral receptors, and antibodies that neutralize infectivity of the virus usually recognize the E protein. Id. at 996. The M and E proteins have C-terminal membrane spanning segments that serve to anchor these proteins to the membrane. Id. at 998.

Dengue viruses are primarily transmitted to humans through the mosquito Aedes aegypti. There is a significant threat of dengue infection to people living in or visiting tropical areas. Indeed, an estimated 2.5 billion people live in areas at risk for transmission and over 100 million humans are infected each year. Infection with dengue virus is estimated to kill approximately 20-25,000 children each year.

An initial dengue virus infection is clinically manifested for most of the cases by dengue fever (DF), which is a self-limited fibril illness. Although rarely fatal, DF is characterized by often-severe disseminated body pain, headache, fever, rash, lymphadenopathy and leukopenia. Subsequent infection with a heterologous Dengue virus can lead to the much more severe to fatal disease of dengue hemorrhagic fever (DHF) or dengue shock syndrome (DSS). It is hypothesized that the presence of antibodies to the serotype causing the primary infection enhances the infection by a heterologous serotype in secondary infections. This phenomenon is referred to as antibody-dependent enhancement (ADE) of the disease.

Effective diagnosis of dengue virus is often problematic. All four dengue virus serotypes can be prevalent in one local area and it is therefore important to test a subject's serum samples simultaneously against all 4 dengue virus serotypes.

There is currently no specific treatment for dengue virus infections. Although the development of dengue virus vaccines has been ongoing for the past 50 years, no successful vaccine to dengue virus has been produced and no licensed dengue virus vaccine is yet available. A major challenge is to generate a tetravalent vaccine that induces neutralizing antibodies against all four strains of dengue to avoid ADE when the individual encounters viruses of two or more different serotypes. Vaccine strategies using a mixture of DEN 1-4 attenuated viruses have been largely unsuccessful, because the antigens from one type will tend to dominate or “mask” the others, producing an incomplete immune response across the four types.

There remains a need for molecules, compositions and methods for effectively diagnosing one or more dengue viruses, inducing, enhancing, or promoting an immune response to flaviviruses, particularly dengue viruses, and preferably to all four dengue virus serotypes, and prophylactically or therapeutically treating disorders or diseases related to one or more such viruses. The present invention provides such molecules, methods and compositions. These and other advantages of the invention, as well as additional inventive features, will be apparent from the description of the invention provided herein.

SUMMARY OF THE INVENTION

The invention provides novel recombinant, synthetic, mutant, and/or isolated polypeptides as described herein, fusion proteins comprising such polypeptides, and nucleic acids encoding such polypeptides and/or proteins, that are useful in promoting (e.g., inducing and/or enhancing) an immune response to one or more flaviviruses, particularly one or more dengue viruses, and detecting or diagnosing the presence of anti-Flaviviridae virus antibodies (e.g., anti-flavivirus antibodies) against at least one virus of the Flaviviridae family (e.g., members of the Flavivirus genus), and/or anti-dengue virus antibodies against at least one dengue virus in a biological sample.

In one aspect, for example, the invention provides recombinant, synthetic, mutant, and/or isolated polypeptides that each comprise an amino acid sequence that has at least about 80%, 83%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% amino acid sequence identity to an amino acid sequence comprising a truncated recombinant, synthetic or mutant dengue virus envelope (E) protein polypeptide of the invention, such as at least one of any of SEQ ID NOS:1-49 and 153-155. Some such polypeptides induce an immune response in a subject, e.g., mammal or population of mammalian cells. against at least one dengue virus, or dengue virus antigen, of at least one serotype selected from the group of dengue-1, dengue-2, dengue-3, and dengue-4 in vitro or ex vivo in cells and/or in vivo in a subject or cells or tissue thereof. Such a recombinant dengue E protein polypeptide is “truncated,” since by comparison with the full length sequence of a wild-type (WT) dengue virus E protein, it lacks one or more amino acid residues from the C terminal of the full length E protein sequence. Usually, a truncated E protein lacks from about 3%, 5%, 10%, 15%, or 20% to about 25% of the C terminal amino acid residues of the full length E protein. Each such polypeptide comprises an immunogenic or antigenic amino acid sequence that is capable of inducing an immune response against one or more dengue viruses, or virus-like particle (VLP) or antigen thereof, of one or more, preferably multiple (e.g., 2, 3, or 4) serotypes. Some such polypeptides induce an immune response to all 4 dengue virus serotypes (a tetravalent immune response) or antigens, when expressed in, or delivered to, an animal or animal cell. Particular polypeptides having such characteristics advantageously are capable of inducing neutralizing antibodies against dengue viruses of multiple dengue virus serotypes upon expression in, or delivery to, an animal or animal cell(s), and, preferably, are able to induce a protective immune response against at least one of the four dengue virus serotypes in a subject, e.g., mammal, such as a primate or a human. Preferably, the recombinant polypeptide induces a protective immune response against all four dengue virus serotypes in a subject, e.g., mammal.

Such truncated E protein polypeptides of the invention induce an immune response in a subject against at least one dengue virus, or VLP or antigen thereof, of each of at least one, two, three, or four serotypes selected from the group of dengue-1, dengue-2, dengue-3, and dengue-4 that is about equal to or greater than an immune response induced in the subject against the at least one dengue virus of each of the at least one, two, three or four serotypes, or VLP or antigen thereof, by a wild-type truncated E protein of each said at least one dengue virus of each of the one, two, three or four serotypes, respectively, wherein said wild-type truncated E protein has an amino acid sequence length substantially equivalent to that of the recombinant, synthetic, or mutant polypeptide of the invention.

Each such recombinant, synthetic, or mutant polypeptides induces production of one or more types of antibodies that bind to at least one dengue virus of each of at least one, two, three or four serotypes. In one aspect of the invention, some such polypeptides induce production of a number or population of antibodies that bind to at least one dengue virus of each of at least one, two, three or four serotypes that is about equal to or greater than the number induced by a wild-type truncated E protein of the at least one dengue virus of each of the at least one, two, three or four serotypes, respectively.

Each such recombinant, synthetic, or mutant polypeptide of the invention induces or produces a titer of neutralizing antibodies against at least one dengue virus of each of at least one, two, three or four serotypes. Further, some such polypeptides induce or produce a titer of neutralizing antibodies against at least one dengue virus of each of at least one, two, three, or four serotypes that is about equal to or greater than a titer of neutralizing antibodies induced or produced against the at least one dengue virus of each of the at least one, two, three, or four serotypes by a wild-type truncated E protein of the at least one dengue virus of each of the at least one, two, three or four serotypes, respectively, wherein each said wild-type truncated E protein is selected from the group of SEQ ID NOS:338-341.

Some such recombinant, synthetic, or mutant polypeptides further comprise: (a) an amino acid sequence of at least about 150 amino acid residues that has at least about 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more amino acid sequence identity to at least one of SEQ ID NOS:117-126 fused to the N-terminus of the amino acid sequence of the recombinant or synthetic polypeptide; (b) an amino acid sequence of at least about 40 amino acid residues that has at least about 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more amino acid sequence identity to at least one of SEQ ID NOS:127-136 fused to the C-terminus of the amino acid sequence of the recombinant or synthetic polypeptide; or (c) the amino acid sequence of (a) and the amino acid sequence of (b).

The invention also provides a recombinant, synthetic, or mutant truncated or non-truncated dengue virus envelope protein, wherein the recombinant or synthetic polypeptide is encoded a nucleic acid comprising a polynucleotide sequence selected from the group of: (a) a polynucleotide sequence having at least about 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more nucleotide sequence identity to at least one polynucleotide sequence selected from the group of SEQ ID NOS:285-330 or a complementary polynucleotide sequence thereof; (b) a RNA polynucleotide sequence comprising a DNA sequence selected from the group of SEQ ID NOS:285-330 in which all of the thymine nucleotide residues in the DNA sequence are replaced with uracil nucleotide residues or a complementary RNA polynucleotide sequence thereof; (c) a RNA polynucleotide sequence that has at least about 70, 75, 80, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99% nucleotide sequence identity to at least one RNA polynucleotide sequence of (b) or a complementary RNA polynucleotide sequence thereof; (d) a polynucleotide sequence that hybridizes under at least stringent conditions over substantially the entire length of a polynucleotide sequence of (a)-(c); (e) a polynucleotide sequence which would hybridize under at least stringent conditions over substantially the entire length of a polynucleotide sequence of any of (a)-(d) but for the degeneracy of the genetic code; and (f) a polynucleotide sequence that possesses any combination of the features of the polynucleotide sequences of (a)-(e); or a complementary sequence of any thereof.

In another aspect, the invention provides isolated, recombinant, synthetic, or mutant polypeptides that each comprise an amino acid sequence that has at least about 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% or more amino acid sequence identity to the amino acid sequence of at least one of SEQ ID NOS:65-116. Each such polypeptide typically comprises a recombinant, synthetic, or mutant dengue virus antigen comprising a PRM15/truncated E polypeptide. Such PRM15/truncated E protein polypeptide comprises a first amino acid sequence comprising a methionine at the N terminus fused to a second recombinant, mutant, or synthetic amino acid sequence comprising about 15 amino acid residues that may correspond to about the last 15 amino acids of the C terminal of a recombinant, synthetic or mutant dengue virus prM protein. The C terminus of the second amino acid sequence is fused in turn to the N terminus of a 3rd sequence which comprises a recombinant, mutant or synthetic truncated dengue virus E protein. The truncated E protein is deemed “truncated,” since by comparison with a WT full length dengue virus E protein, it lacks one or more amino acids at the C terminus of the E protein. A truncated E protein may comprise about 75, 80, 85 or 90% E protein, said 75, 80, 85, or 90% E protein representing a portion of the E protein that comprises about 75, 80, 85, or 90%, respectively, of its length starting from amino acid 1 at its N-terminus. Such recombinant or synthetic PRM15/truncated E polypeptide induces an immune response against at least one dengue virus of at least one serotype, or VLP or antigen thereof.

Some such PRM15/truncated E polypeptides induce an immune response in a subject against at least one dengue virus of one serotype that is about equal to or greater than the immune response induced in the mammal against the same dengue virus of the same serotype by a PRM15/truncated E protein derived from the dengue virus serotype. Some such PRM15/truncated E polypeptides induce an immune response in a subject against each of at least 2 dengue viruses of 2 different serotypes that is about equal to or greater than the immune response induced in the subject against each of these 2 dengue viruses of different serotypes by a PRM15/truncated E protein made from either of the two serotypes. Some such PRM15/truncated E polypeptides each induce an immune response against each of at least three dengue viruses of three different serotypes that is about equal to or greater than the immune response induced in such cell against these three dengue viruses of three different serotypes by a PRM15/truncated E protein derived from any of the three serotypes. Some such PRM15/truncated E polypeptides induce an immune response in a subject against each of at least 4 dengue viruses of four different serotypes that is about equal to or greater than the immune response induced in such cell against these 4 dengue viruses of four different serotypes by a PRM15/truncated E protein derived from any of the four serotypes. In one aspect, a WTPRM15/truncated E protein polypeptide is selected from SEQ ID NOS:149-152.

PRM15/truncated E polypeptides are capable of inducing production of a population of antibodies comprising one or more types of antibodies that bind to at least one dengue virus of each of the at least 1, 2, 3 or 4 serotypes. Some such polypeptides induce the production of a population or number of antibodies that bind to at least one dengue virus of each of at least one, two, three or four serotypes that is about equal to or greater than is induced by the wild-type PRM15/truncated E protein polypeptide of each of the at least one, two, three, or four serotypes.

Furthermore, recombinant, synthetic, or mutant PRM15/truncated E polypeptides induce the production of a population of antibodies that bind more specifically to at least one dengue virus of each of at least one, two, three or four serotypes than is induced by the wild-type PRM15/truncated E polypeptide of the at least one dengue virus of each of the at least one, two, three, or four serotypes, respectively, wherein each said wild-type PRM15/truncated E polypeptide is selected from SEQ ID NOS:149-152.

In one aspect, recombinant, synthetic, or mutant PRM15/truncated E polypeptides induce the production of a titer of neutralizing antibodies against at least one dengue virus of each of at least one, two, three, or four serotypes. In a particular aspect, some of these recombinant, synthetic, or mutant PRM15/truncated E polypeptides induce the production of a titer of neutralizing antibodies against at least one dengue virus of each of at least one, two, three, or four serotypes that is about equal to or greater than a titer of neutralizing antibodies induced in the subject against the at least one dengue virus of each of at least one, two, three, or four serotypes, respectively, by a corresponding wild-type PRM15/truncated E polypeptide (“WT PRM15/truncated E fusion protein”) of the at least one dengue virus of each of the at least one, two, three or four serotypes, wherein each said WT PRM15/truncated E protein polypeptide is selected from SEQ ID NOS: 149-152. The sequence of SEQ ID NO:149, for example, comprises the following amino acid sequence: a methionine as the first amino acid residue, the last 15 amino acids from the C terminus of the prM sequence of WT DEN-1, and a truncated amino acid sequence of DEN-1 envelope protein, which is termed “truncated” because it excludes a number of amino acid residues from the C terminus of the envelope protein of DEN-1. For example, in one embodiment, about 11-14% (and preferably about 13%) of the amino acid residues of the C terminus of the E protein of Den-1 were excluded.

The invention also provides recombinant, synthetic, or mutant PRM15/truncated E polypeptides, each encoded by a nucleic acid comprising a polynucleotide sequence selected from the group of: (a) a polynucleotide sequence having at least about 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more nucleic acid sequence identity to at least one polynucleotide sequence selected from the group of SEQ ID NOS:156-200, 235, 342, and 344, or a complementary polynucleotide sequence thereof; (b) a RNA polynucleotide sequence comprising a DNA sequence selected from the group of SEQ ID NOS:156-200, 235, 342, and 344 in which all of the thymine nucleotide residues in the DNA sequence are replaced with uracil nucleotide residues or a complementary RNA polynucleotide sequence thereof; (c) a RNA polynucleotide sequence that has at least about 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more nucleic acid sequence identity to at least one RNA polynucleotide sequence of (b) or a complementary RNA polynucleotide sequence thereof; (d) a polynucleotide sequence that hybridizes under at least stringent conditions over substantially the entire length of a polynucleotide sequence of (a)-(c); (e) a polynucleotide sequence which would hybridize under at least stringent conditions over substantially the entire length of a polynucleotide sequence of any of (a)-(d) but for the degeneracy of the genetic code; and (f) a polynucleotide sequence that possesses any combination of the features of the polynucleotide sequences of (a)-(e).

In another aspect, the invention provides recombinant, synthetic, mutant, and/or isolated polypeptides, each of which comprises an amino acid sequence that has at least about 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more amino acid sequence identity to an amino acid sequence of at least one of SEQ ID NOS:139-148, 236-253, 343, and 345. Each such polypeptide typically comprises a recombinant, synthetic, or mutant dengue virus antigen comprising a fusion protein comprising a C15/full length prM protein/full length E protein. C15/full length prM/full length E protein polypeptides induce an immune response in a subject against at least one dengue virus of each of at least one, two, three, or four dengue virus serotypes. Further, for some such polypeptides, the immune response induced against at least one dengue virus of each of at least 1, 2, 3 or 4 serotypes is about equal to or greater than an immune response induced in such cell against the at least one dengue virus of each of the at least 1, 2, 3 or 4 serotypes by a wild-type C15/full length prM/full length E fusion protein polypeptide of each of the at least one, two, three or four serotypes, respectively, wherein the corresponding wild-type C15/full length prM/full length E fusion protein is selected from SEQ ID NOS:227-230.

For C15/full length prM protein/full length E protein polypeptides, the immune response may comprise the production of antibodies that bind to at least one dengue virus of each of at least one, two, three or four serotypes. In addition, some such polypeptides may induce production of a number of antibodies that bind to at least one dengue virus of each of the at least one, two, three, or four serotypes that is about equal to or greater than that induced by a corresponding wild-type C15/full length prM/full length E fusion protein of each of the at least one, two, three, or four serotypes, respectively, wherein each wild-type C15/full length prM/full length E fusion protein is selected from SEQ ID NOS:227-230.

In another aspect, some such C15/full length prM/full length E fusion protein polypeptides of the invention induce or produce a titer of neutralizing antibodies against at least one dengue virus of each of at least one, two, three, or four dengue virus serotypes. Furthermore, some such polypeptides induce or produce a titer of neutralizing antibodies in a subject against at least one dengue virus of each of at least one, two, three, or four serotypes that is about equal to or greater than a titer of neutralizing antibodies induced or produced in the subject against the at least one dengue virus of each of at least one, two, three, or four serotypes by a wild-type C15/full length prM/full length E fusion protein polypeptide of the at least one dengue virus of each of the at least one, two, three, or four serotypes, respectively, wherein each said wild-type C15/full length prM/full length E fusion protein polypeptide is selected from SEQ ID NOS:227-230.

The invention also provides recombinant, synthetic, or mutant C15/full length prM/full length E fusion protein polypeptides, wherein each such polypeptide is encoded by a nucleic acid comprising a polynucleotide sequence selected from the group of: (a) a polynucleotide sequence having at least about 80%, 85%, 90%, 93%, 95%, 98% or more nucleic acid sequence identity to at least one polynucleotide sequence selected from the group of SEQ ID NOS:201-210, 254-271, 342, and 344, or a complementary polynucleotide sequence thereof; (b) a RNA polynucleotide sequence comprising a DNA sequence selected from the group of SEQ ID NOS:201-210, 254-271, 342, and 344 in which all of the thymine nucleotide residues in the DNA sequence are replaced with uracil nucleotide residues or a complementary RNA polynucleotide sequence thereof; (c) a RNA polynucleotide sequence that has at least about 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more nucleic acid sequence identity to at least one RNA polynucleotide sequence of (b) or a complementary RNA polynucleotide sequence thereof; (d) a polynucleotide sequence that hybridizes under at least stringent conditions over substantially the entire length of a polynucleotide sequence of (a)-(c); (e) a polynucleotide sequence which would hybridize under at least stringent conditions over substantially the entire length of a polynucleotide sequence of any of (a)-(d) but for the degeneracy of the genetic code; and (f) a polynucleotide sequence that possesses any combination of the features of the polynucleotide sequences of (a)-(e).

In another aspect, the invention provides a composition comprising at least one recombinant, mutant, synthetic and/or isolated polypeptide or nucleic acid of the invention and an excipient or carrier, including a pharmaceutically acceptable carrier or excipient.

In one aspect, the invention provides a composition comprising at least one recombinant, mutant, synthetic and/or isolated polypeptide comprising an amino acid sequence selected from the group of SEQ ID NOS:1-49, 65-116, 139-148, 153-155, 236-253, 343, and 345, or an antigenic or immunogenic polypeptide fragment thereof, that induces an immune response in a subject against at least one dengue virus of at least one virus serotype that is about equal to or greater than the immune response induced by a antigenic or immunogenic polypeptide fragment of the at least one dengue virus of the at least one serotype; and an excipient or carrier.

In yet another aspect, the invention provides an isolated, recombinant, mutant, or synthetic nucleic acid comprising a polynucleotide sequence selected from the group of: (a) a polynucleotide sequence having at least about 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more nucleic acid sequence identity to a sequence selected from the group of SEQ ID NOS:211-218 or a complementary polynucleotide sequence thereof; (b) a RNA polynucleotide sequence having at least about 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more nucleic acid sequence identity to a DNA sequence selected from the group of SEQ ID NOS:211-218 in which all of the thymine nucleotide residues in the DNA sequence are replaced with uracil nucleotide residues or a complementary RNA polynucleotide sequence thereof; and (c) a polynucleotide sequence that hybridizes under at least stringent conditions over substantially the entire length of a polynucleotide sequence of (a) or (b), or a complementary sequence thereof.

In another aspect, the invention includes an isolated, recombinant, mutant, or synthetic nucleic acid comprising a polynucleotide sequence selected from the group of: (a) a polynucleotide sequence comprising a nucleotide sequence having at least about 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more nucleic acid sequence identity to a sequence selected from the group of SEQ ID NOS:285-330, or a complementary polynucleotide sequence thereof; (b) a polynucleotide sequence encoding a polypeptide selected from SEQ ID NOS:1-49 and 153-155, or a complementary polynucleotide sequence thereof; (c) a RNA polynucleotide sequence having at least about 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more sequence identity to a DNA sequence selected from the group of SEQ ID NOS:285-330 in which all of the thymine nucleotide residues in the DNA sequence are replaced with uracil nucleotide residues, or a complementary RNA polynucleotide sequence thereof; (d) a polynucleotide sequence that hybridizes under at least stringent conditions over substantially the entire length of a polynucleotide sequence of (a)-(c); (e) a polynucleotide sequence which would hybridize under at least stringent conditions over substantially the entire length of a polynucleotide sequence of any of (a)-(d) but for the degeneracy of the genetic code; (f) a polynucleotide sequence having at least about 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more sequence identity to at least one polynucleotide sequence selected from the group of SEQ ID NOS:285-330, or a complementary polynucleotide sequence thereof, wherein said polynucleotide sequence encodes a polypeptide that induces an immune response in a subject against at least one dengue virus of at least one serotype selected from dengue-1, dengue-2, dengue-3, and dengue-4 that is about equal to or greater than an immune response induced in the subject against the at least one dengue virus of the at least one serotype by a wild-type truncated envelope (E) protein of at least one dengue virus of the at least one serotype, wherein said wild-type truncated E protein is selected from the group of SEQ ID NOS:338-341; and (g) a polynucleotide sequence that possesses any combination of the features of the polynucleotide sequences of (a)-(f). The invention also provides a composition comprising an excipient or carrier and at least one nucleic acid of the invention, including, e.g., at least one polynucleotide sequence as defined by any of (a)-(g) above.

Also provided are isolated, recombinant, mutant, and/or synthetic nucleic acids that each comprise a polynucleotide sequence selected from the group of: (a) a polynucleotide sequence having at least about 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more sequence identity to a sequence selected from the group of SEQ ID NOS:156-200, 235, 342, and 344, or a complementary polynucleotide sequence thereof; (b) a polynucleotide sequence encoding a polypeptide selected from SEQ ID NOS:65-116, or a complementary polynucleotide sequence thereof; (c) an RNA polynucleotide sequence having at least about 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more sequence identity to a DNA sequence selected from the group of SEQ ID NOS:156-200, 235, 342, and 344, in which all of the thymine nucleotide residues in the DNA sequence are replaced with uracil nucleotide residues, or a complementary RNA polynucleotide sequence thereof; (d) a polynucleotide sequence that hybridizes under at least stringent conditions over substantially the entire length of a polynucleotide sequence of (a)-(c); (e) a polynucleotide sequence which would hybridize under at least stringent conditions over substantially the entire length of a polynucleotide sequence of any of (a)-(d) but for the degeneracy of the genetic code; (f) a polynucleotide sequence or fragment thereof having at least about 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more sequence identity to at least one polynucleotide sequence selected from the group of SEQ ID NOS:156-200, 235, 342, and 344, or a complementary polynucleotide sequence thereof, wherein said polynucleotide sequence or fragment thereof encodes a polypeptide that induces an immune response in a subject against at least one dengue virus of at least one serotype selected from DEN-1, DEN-2, DEN-3, and DEN-4 that is about equal to or greater than an immune response induced in the subject against the at least one dengue virus of the at least one serotype by a wild-type truncated envelope (E) protein of at least one dengue virus of the at least one serotype, wherein said wild-type truncated E protein is selected from any of SEQ ID NOS:149-152; and (g) a polynucleotide sequence that possesses any combination of features of the sequences of (a)-(f).

In another aspect, the invention provides isolated, recombinant, mutant, and/or synthetic nucleic acids that each comprise a polynucleotide sequence selected from the group of: (a) a polynucleotide sequence having at least about 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more sequence identity to a sequence selected from the group of SEQ ID NOS:201-210, 254-271, 342, and 344, or a complementary polynucleotide sequence thereof; (b) a polynucleotide sequence encoding a polypeptide selected from SEQ ID NOS:139-148, 236-253, 343, and 345 or a complementary polynucleotide sequence thereof; (c) a RNA polynucleotide sequence having at least about 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more sequence identity to a DNA sequence selected from the group of SEQ ID NOS:201-210, 254-271, 342, and 344 in which all of the thymine nucleotide residues in the DNA sequence are replaced with uracil nucleotide residues or a complementary RNA polynucleotide sequence thereof; (d) a polynucleotide sequence that hybridizes under at least stringent conditions over substantially the entire length of a polynucleotide sequence of (a)-(c); (e) a polynucleotide sequence which would hybridize under at least stringent conditions over substantially the entire length of a polynucleotide sequence of any of (a)-(e) but for the degeneracy of the genetic code; (f) a polynucleotide sequence, or fragment thereof, having at least about 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more sequence identity to at least one polynucleotide sequence selected from the group of SEQ ID NOS:201-210, 254-271, 342, and 344, or a complementary polynucleotide sequence thereof, wherein said polynucleotide sequence or fragment thereof encodes a polypeptide that induces an immune response in a subject against at least one dengue virus of at least one serotype selected from DEN-1, DEN-2, DEN-3, and DEN-4 that is about equal to or greater than an immune response induced in the subject against the at least one dengue virus of the at least one serotype by a WT truncated envelope (E) protein of at least one dengue virus of the at least one serotype, wherein said WT truncated E protein is selected from the group of SEQ ID NOS:227-230; and (g) a polynucleotide sequence that possesses any combination of the features of the polynucleotide sequences of (a)-(f).

Other advantageous features of the aforementioned polypeptides and polynucleotides encoding such polypeptide include the ability to induce a protective immune response against one or more dengue viruses, preferably against dengue viruses of multiple virus serotypes, in a subject, such as, e.g., an animal, including a mammal, or cell(s) thereof. Other desirable features of such polypeptides include higher expression, higher secretion, and/or more or more specific antibody binding exhibited by such polypeptides with respect to wild-type dengue virus proteins, including, e.g., C15/full length prM/full length E fusion proteins, full length prM/full length E fusion proteins, PRM15/truncated E protein polypeptides, PRM15/full length E fusion protein, full length E or truncated E proteins, and/or one or more fragments of any thereof, as described herein.

The invention further provides fusion proteins comprising the aforementioned polynucleotides of the invention; vectors comprising one or more of the polynucleotides of the invention; cells comprising such polypeptides, vectors, polynucleotides, and fusion proteins; and pharmaceutical compositions comprising such polypeptides, polynucleotides, vectors, fusion proteins, and/or cells. Exemplary vectors provided by the invention include viral vectors, including, e.g., flaviviral vectors (including, e.g., attenuated flaviviral vectors comprising a polynucleotide of the invention in place of at least a portion of the flaviviral vector genome encoding a wild-type C15/full length prM/full length E protein, full length prM/full length E fusion protein, PRM15/truncated E protein polypeptide, PRM15/full length E fusion protein, full length E or truncated E protein, and/or fragments of any thereof) and nucleotide vectors, such as the plasmid vector pMaxVax10.1 (described in Example 1 and shown in FIGS. 1 and 2), which the inventors have discovered to be an effective gene delivery vehicle in mammalian hosts.

With respect to polynucleotides, the invention further provides, for example, a polynucleotide comprising a nucleic acid sequence of at least about 1200 nucleotides that has at least about 65%, 75%, 80%, 85%, or 90% nucleotide sequence identity with at least one of SEQ ID NOS:285-330, as well as polynucleotides which hybridize with such a polynucleotide, and polynucleotides comprising a sequence which is the complement of the nucleic acid sequence. In another aspect, the invention provides a nucleic acid comprising a sequence of at least about 1200 nucleotides that has at least about 70%, 75%, 80%, or 90% nucleic sequence identity to at least one of SEQ ID NOS:211-214.

The invention also provides methods of promoting (inducing and/or enhancing) an immune response to a dengue virus in a subject (e.g., animal, such as a mammal) by administering polypeptides, fusion protein, polynucleotides, vectors, or cells of the invention as described herein. In some such methods, administration of an immunogenic or antigenic polypeptide of the invention (preferably, e.g., a polypeptide which induces a neutralizing antibody response against one or more dengue viruses of multiple virus serotypes) to a subject is followed by repeat administration at selected time periods, resulting in an improved immune response. Such “boosting” administration strategies also are advantageously performed in conjunction with the administration of a polynucleotide of the invention (e.g., prophylactic or therapeutic administration of an immunogen-encoding polynucleotide (e.g., DNA vaccine)) is preferably followed by subsequent administration of additional immunogen-encoding or antigen-encoding polynucleotide of the invention and/or an immunogenic or antigenic polypeptide of the invention).

In another aspect, the invention provides a composition comprising a library of at least two recombinant or synthetic nucleic acids obtained by a method comprising recombining at least a first nucleic acid comprising a sequence selected from SEQ ID NOS:211-214, and at least a second nucleic acid, wherein the first and second nucleic acids differ from each other in two or more nucleotides, to produce a library of recombinant or synthetic nucleic acids.

The invention also includes a composition comprising a library of nucleic acids obtained by a method comprising recombining at least a first nucleic acid comprising a sequence selected from the group of SEQ ID NOS:215-218, and at least a second nucleic acid, wherein the first and second nucleic acids differ from each other in two or more nucleotides, to produce a library of recombinant or synthetic nucleic acids.

In another aspect, the invention provides a polypeptide which is specifically bound by polyclonal antisera raised against at least one antigen, the at least one antigen comprising an amino acid sequence selected from the group of SEQ ID NOS:1-49 and 153-155, or an antigenic or immunogenic fragment thereof, wherein said antigenic or immunogenic polypeptide fragment thereof that induces an immune response in a subject against at least one dengue virus of at least one virus serotype that is about equal to or greater than the immune response induced in the subject by a antigenic or immunogenic polypeptide fragment of the at least one dengue virus of the at least one serotype, wherein the polyclonal antisera is subtracted with at least one of: a truncated envelope protein selected from the group of SEQ ID NOS:338-341 and a truncated envelope protein comprising a known wild-type truncated dengue virus protein sequence, or an amino acid sequence fragment of a polypeptide sequence corresponding to a known wild-type dengue virus E protein, wherein said amino acid sequence fragment has a length substantially identical (e.g., at least about 75%, 80%, 85%, 86%, 87%, 88% or 89%, preferably at least about 90%, 91%, 92%, 93%, or 94%, and more preferably at least about 95% (e.g., about 87-95%), 96% 97%, 98%, 99%, 99.5% sequence identity) to the truncated envelope protein of any of SEQ ID NOS:338-341.

The invention also provides polypeptides that are specifically bound by polyclonal antisera raised against at least one antigen, the at least one antigen comprising an amino acid sequence selected from the group of SEQ ID NOS:65-116, wherein the polyclonal antisera is subtracted with at least one of: a PRM15/truncated envelope protein selected from the group of SEQ ID NOS:149-152 and other amino acid sequences comprising known dengue virus PRM15/truncated envelope proteins.

The invention also provides polypeptides that are specifically bound by polyclonal antisera raised against at least one antigen, the at least one antigen comprising an amino acid sequence selected from the group of SEQ ID NOS:139-148, 236-253, 343, and 345, wherein the polyclonal antisera is subtracted with at least one of: a fusion protein comprising a C15/full length prM protein/full length E protein selected from the group of SEQ ID NOS:227-230 and other amino acid sequences comprising known dengue virus C15/full length prM protein/full length E protein PRM15/truncated envelope proteins.

The invention also includes an antibody or antisera produced by administering a truncated E polypeptide of the invention to a subject, which antibody or antisera specifically binds at least one antigen, the at least one antigen comprising a polypeptide comprising at least one amino acid sequence of SEQ ID NOS:1-49 and 153-155, which antibody or antisera does not specifically bind to at one or more of: the polypeptides of SEQ ID NOS:338-341 and a truncated envelope protein comprising a known wild-type truncated dengue virus protein sequence, or an amino acid sequence fragment of a polypeptide sequence corresponding to a known wild-type dengue virus E protein, wherein said amino acid sequence fragment has a length substantially identical (e.g., at least about 75%, 80%, 85%, 86%, 87%, 88% or 89%, preferably at least about 90%, 91%, 92%, 93%, or 94%, and more preferably at least about 95% (e.g., about 87-95%), 96% 97%, 98%, 99%, 99.5% sequence identity) to the truncated envelope protein of any of SEQ ID NOS:338-341.

In another aspect, the invention provides an antibody or antisera produced by administering a truncated E polypeptide of the invention to a subject, which antibody or antisera specifically binds at least one antigen, the at least one antigen comprising a polypeptide comprising at least one amino acid sequence of SEQ ID NOS:65-116, which antibody or antisera does not specifically bind to at one or more of: a PRM15/truncated envelope protein selected from the group of SEQ ID NOS:149-152 and other amino acid sequences comprising known dengue virus PRM15/truncated envelope proteins. In yet another aspect, the invention provides an antibody or antisera produced by administering a truncated E polypeptide of the invention to a subject, which antibody or antisera specifically binds at least one antigen, the at least one antigen comprising a polypeptide comprising at least sequence of SEQ ID NOS:139-148, 235-253, 343, and 345, which antibody or antisera does not specifically bind to at one or more of: a fusion protein comprising a C15/full length prM protein/full length E protein selected from the group of SEQ ID NOS:227-230 and other amino acid sequences comprising known dengue virus C15/full length prM protein/full length E protein PRM15/truncated envelope proteins.

The invention also includes a pharmaceutical composition comprising at least one polypeptide of the invention (or at least one polynucleotide of the invention), and a pharmaceutically acceptable diluent, carrier, or excipient, wherein the at least one polypeptide (or polynucleotide) is present in an amount effective to provide a subject with protective immunity to at least one dengue virus of at least 1, 2, 3, or 4 dengue serotypes.

The invention also includes a pharmaceutical composition comprising at least one polypeptide of the invention (or at least one polynucleotide of the invention), and a pharmaceutically acceptable diluent, carrier, or excipient, wherein the at least one polypeptide (or polynucleotide) is present in an amount effective to induce an immune response (e.g., specific immune response) to at least one dengue virus of at least one, two, three, or four dengue virus serotypes and/or provide a subject with protective immunity to at least one dengue virus of at least one, two, three, or four dengue virus serotypes.

In another aspect, the invention provides a vaccine comprising at least one polypeptide of the invention (or polynucleotide of the invention) in an amount effective to provide a subject with protective immunity to at least one dengue virus of at least one, two, three, or four virus serotypes, and a pharmaceutically acceptable diluent, carrier, or excipient.

Also provided are methods of producing antibodies to at least one dengue virus of at least one serotype in a subject which comprise administering to said subject at least one nucleic acid or polypeptide of the invention, or a combination of both. Also included are methods of producing one or more antibodies that bind to at least one dengue virus of at least one serotype which comprise administering an effective amount of a polypeptide and/or nucleic acid of the invention, or a composition of either or both, to a population of cells such that the cells produce one or more antibodies that bind to at least one dengue virus of at least one serotype.

The invention further provides a method of producing a protective immune response against at least one dengue virus of each of at least one, two, three or four dengue virus serotypes in a subject, wherein the method comprises administering to the subject an amount effective of at least one nucleic acid of the invention sufficient to produce a protective immune response against challenge by the at least one dengue virus of each of the at least one, two, three, or four serotypes, respectively. For some such methods, the immune response is a protective antibody response, such that when said at least one nucleic acid is expressed, antibodies to at least one dengue virus of each of at least one, two, three, or four serotypes are generated in the subject at a level sufficient to produce a protective antibody response against challenge by the at least one dengue virus of each of the at least one, two, three of four serotypes, respectively.

In another aspect, the invention provides a method of inducing an immune response in a subject, such as a mammal, to at least one dengue virus of at least one serotype comprising administering an effective amount of at least one polypeptide of the invention, or an effective amount of a composition thereof, or both, to a subject. The invention also provides a method of inducing an immune response in a subject to at least one dengue virus of at least one serotype comprising administering an effective amount of at least one nucleic acid of the invention, or an effective amount of a composition thereof, or both, to a subject. The effective amount is usually an immunogenic or antigenic amount that facilitates induces a protective immune response or therapeutic or prophylactic treatment.

In addition, the invention provides a method of promoting an immune response in a subject to at least one dengue virus of at least one serotype comprising introducing at least one nucleic acid or polypeptide of the invention into a population of cells and delivering (e.g., implanting) the cells in a subject. The population of cells may initially have been obtained from the subject before introduction of the nucleic acid or polypeptide.

In another aspect, the invention provides a target nucleic acid which, but for the degeneracy of the genetic code, hybridizes under at least stringent conditions to a unique coding oligonucleotide which encodes a unique subsequence in a polypeptide selected from: SEQ ID NOS:65-116, wherein the unique subsequence is unique as compared to a known dengue virus antigen envelope polypeptide, any polypeptide selected from SEQ ID NOS:149-150, or a polypeptide encoded by any of SEQ ID NOS:231-234.

In another aspect, the invention provides a recombinant or synthetic polypeptide comprising an amino acid sequence that has at least about 90% amino acid sequence identity to an amino acid sequence comprising a polypeptide fragment of at least one polypeptide sequence selected from SEQ ID NOS:236-253, wherein the polypeptide fragment does not include the first 16 amino acid residues of the selected polypeptide sequence of SEQ IDS:236-253, and wherein the recombinant or synthetic polypeptide induces an immune response in a subject or cells of the subject against at least one dengue virus of each of at least two serotypes selected from the group of dengue-1, dengue-2, dengue-3, and dengue-4 that is about equal to or greater than an immune response induced in the subject or cells thereof against the at least one dengue virus of the at least two serotypes by a wild-type envelope (E) protein of at least one dengue virus of each of the at least two serotypes selected from SEQ ID NO:338, 339, 340, and 341, respectively.

Also included is a recombinant or synthetic polypeptide comprising an amino acid sequence that has at least about 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more amino acid sequence identity to an amino acid sequence of at least one of SEQ ID NOS:1-49 and 153-155, wherein the recombinant or synthetic polypeptide induces an immune response in a subject or a population of cells thereof against at least one dengue virus of each of at least two serotypes selected from the group of dengue-1, dengue-2, dengue-3, and dengue-4 that is about equal to or greater than that induced by a WT E protein of each of said at least one dengue virus, respectively, wherein said WT E protein has an amino acid sequence length substantially equivalent or identical to that of the recombinant or synthetic polypeptide.

In another aspect, the invention provides a recombinant or synthetic polypeptide comprising an amino acid sequence that has at least about 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more amino acid sequence identity to an amino acid sequence of at least one of SEQ ID NOS:1-49 and 153-155, wherein the recombinant or synthetic polypeptide induces an immune response in a subject, e.g., mammal, or cells thereof, against at least one dengue virus of each of at least two serotypes selected from the group of dengue-1, dengue-2, dengue-3, and dengue-4 that is about equal to or greater than an immune response induced in the subject, or cells thereof, against each said at least one dengue virus of each of the at least two dengue serotypes induced by a WT truncated E protein of each of dengue-1, dengue-2, dengue-3, and dengue-4, respectively, wherein said WT truncated E protein has an amino acid sequence length substantially equivalent or identical to that of the recombinant or synthetic polypeptide.

In yet another aspect, the invention provides a recombinant or synthetic polypeptide comprising an amino acid sequence that has at least about 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more amino acid sequence identity to the amino acid sequence of at least one of SEQ ID NOS:65-116, wherein the recombinant or synthetic polypeptide induces an immune response in a subject or population of cells thereof, e.g., mammal or population of mammalian cells, against at least one dengue virus of each of at least two serotypes selected from the group of DEN-1, DEN-2, DEN-3, and DEN-4 that is about equal to or greater than an immune response induced in the subject (e.g., mammal or population of mammalian cells) against each said at least one dengue virus of each of the at least two serotypes by a WT PRM15/truncated E protein of each of DEN-1, DEN-2, DEN-3, and DEN-4, respectively, wherein each said WT PRM15/truncated E protein polypeptide is selected from SEQ ID NOS:149-152.

The invention further provides a recombinant or synthetic polypeptide comprising an amino acid sequence that has at least about 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more amino acid sequence identity to an amino acid sequence of at least one of SEQ ID NOS:139-148, 236-253, 343, and 345, wherein the recombinant or synthetic polypeptide induces an immune response in a subject or population of cells thereof against at least one dengue virus of each of at least two serotypes selected from the group of DEN-1, -2, -3, and -4 virus that is about equal to or greater than an immune response induced in the mammalian cell against the at least one dengue virus of each of the at least two serotypes by a WT C15/full length prM/full length E fusion protein of each of said at least two serotypes, wherein each said WT C15/full length prM/full length E fusion protein is selected from SEQ ID NOS:227-230.

Also included is a protein aggregate formed from a population of at least two recombinant or synthetic polypeptides of the invention. Such protein aggregate includes dimers, trimers, etc. and other multiples of the polypeptides of the invention, which polypeptides need not be identical.

Included are virus-like particles comprising of at least two polypeptides of the invention, which polypeptides need not be identical. Some such virus-like particles are formed from a population of at least two polypeptides selected from among the recombinant, mutant, or synthetic polypeptides. Some such virus-like particles are formed from expression of one or more nucleic acids encoding at least two polypeptides of the invention.

Also provided are viruses, including, e.g., attenuated viruses, comprising at least one of: (1) a nucleic acid of the invention; (2) a polypeptide of the invention; and/or (3) a vector of the invention. The virus may comprise a yellow fever (YF) virus that has been modified with a nucleic acid, polypeptide, and/or vector of the invention. Included is a chimeric virus comprising a dengue virus (e.g., DEN-2 or DEN-4) that comprises at least one polypeptide of the invention in place of or in addition to the respective dengue virus full length or truncated E protein, the respective dengue virus full length prM protein or fragment thereof (e.g., PRM15), and/or the respective dengue virus fusion protein comprising the native dengue virus C15/full length prM/full length envelope protein. Included is an attenuated or replication-deficient chimeric flavivirus (e.g., dengue virus or YF virus) or adenovirus, comprising at least one nucleic acid or polypeptide of the invention in place of the corresponding nucleic acid or polypeptide of the flavivirus or adenovirus, respectively.

Also provided is a DNA or RNA construct or a viable chimeric recombinant flavivirus, said DNA or RNA construct or chimeric recombinant flavivirus comprising a first region of nucleic acid encoding a recombinant protein(s) (e.g., PRM15/truncE, C15/full prM/full E, or prM & E proteins) of the invention) operably linked to a 2nd region of nucleic acid encoding non-structural proteins of a flavivirus (e.g., YF virus or DEN-2, DEN-4).

The invention also includes an integrated system comprising a computer or computer readable medium comprising a database comprising at least one sequence record, each said at least one sequence record comprising at least one character string corresponding to at least one polypeptide sequence selected from the group of SEQ ID NOS:1-49, 65-116, 139-148, 153-155, 236-253, 343, and 345, or at least nucleic acid sequence selected from the group of SEQ ID NOS:156-210, 235, 254-271, 285-330, 342, and 344, the integrated system further comprising a user input interface allowing a user to selectively view said at least one sequence record.

Also provided is a method of using a computer system to present information pertaining to at least one of a plurality of sequence records stored in a database, the sequence records each comprising at least one character string corresponding to SEQ ID NOS:1-49, 65-116, 139-148, 153-210, 235-271, 285-330, 342-345, the method comprising: (a) determining a list of at least one character string corresponding to at least one of the group of SEQ ID NOS:1-49, 65-116, 139-148, 153-210, 235-271, 285-330, 342-345, or a subsequence thereof; (b) determining which said at least one character string of the list is selected by a user; and (c) displaying each selected character string, or aligning each selected character string with an additional character string.

The invention further provides improved and novel methods and techniques for detecting and/or diagnosing the presence of antibodies against one, two, three or four dengue virus serotypes in a sample (including, e.g., a biological sample, such as a serum sample obtained from a subject, such as a animal, including, e.g., a mammal, including, e.g., a human at risk for dengue virus infection). The invention also provides methods and techniques for simultaneously detecting and/or diagnosing the presence of antibodies against one, two, three or four dengue virus serotypes in a single sample. The polypeptides of the invention, and the nucleic acids encoding them, can be used in this respect to detect or diagnose a biological sample for the presence of such antibodies. Advantageously, a composition comprising as little as about 10 μl of the aspirated supernatant of a cell culture transfected with a polynucleotide of the invention can serve as a uniform and suitable substrate for the simultaneous diagnosis or detection of antibodies against all four serotypes of dengue virus in a sample, such as, e.g., a biological sample, including a sample obtained from a subject, such as a mammal, including, e.g., human, at risk for dengue virus infection. The biological sample can be, e.g., a blood serum sample obtained from a subject.

In another aspect, the invention provides a method of diagnosing a sample for, or detecting in a sample, the presence of one or more antibodies that bind to at least one dengue virus of at least one serotype, the method comprising: (a) contacting the sample with at least one polypeptide of the invention under conditions such that if the sample comprises one or more antibodies that bind to the at one dengue virus, at least one anti-dengue virus antibody binds to the at least one polypeptide to form a mixed composition; (b) contacting the mixed composition with at least one affinity molecule that binds to an anti-dengue virus antibody; (c) removing unbound affinity-molecule from the mixed composition; and (d) diagnosing or detecting the presence or absence of one or more affinity molecules, wherein the presence of one or more affinity molecules is indicative of the presence of one or more antibodies that bind to the at least one dengue virus in the sample.

In addition, the invention provides a method of diagnosing a sample for, or detecting in a sample, the presence of at least one antibody that binds to a dengue virus of at least one serotype, said method comprising: (a) contacting the sample with at least one polypeptide of the invention under conditions such that if the sample comprises one or more antibodies that bind to the dengue virus, at least one anti-dengue virus antibody binds to the at least one polypeptide to form at least one antibody-polypeptide complex, and (b) diagnosing or detecting the presence or absence of the at least one antibody-polypeptide complex, wherein the presence of the at least one antibody-polypeptide complex is indicative of the presence of at least one antibody that binds to the dengue virus in the sample.

Methods of producing the above-described nucleic acids and polypeptides also are provided by the present invention. In general, the polynucleotides of the invention are advantageously prepared by standard nucleic acid synthesis techniques (e.g., polymerase chain reaction (PCR)-facilitated overlapping nucleic acid assembly). However, the invention also provides a method of performing recursive sequence recombination with selected nucleic acids of the invention and appropriate screening techniques to generate and identify novel dengue antigen-encoding nucleic acids. Methods of making polypeptides of the invention include the transfection or infection of cells, either in vitro, in vivo, or ex vivo, with the polynucleotides and/or vectors of the invention, in addition to standard synthetic protein synthesis techniques.

Numerous alternative, additional and/or more particular polynucleotides, polypeptides, fusion proteins, viruses, vectors, cells, cell cultures, compositions, transgenic animals, transdermal patches, methods of using nucleic acids and/or polypeptides of the invention, diagnostic and detection assays, methods and techniques provided by the invention are set forth below.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a map of a representative pMaxVax10.1 vector.

FIG. 2 illustrates an exemplary pMaxVax10.1 vector comprising an insertion of a nucleic acid that encodes a fusion protein comprising a PRM15 signal peptide/truncated E protein of a wild-type DEN-2 virus. A pMaxVax10.1 vector may be similarly constructed with a nucleic acid that encodes a fusion protein comprising a PRM15 signal peptide/truncated E protein of a WT DEN-1, DEN-3, or DEN-4 virus, or a variant thereof, in substituted for the nucleic acid sequence encoding the PRM15 signal peptide/truncated E protein of WT DEN-2 (as shown in FIG. 2). The truncated E protein of DEN-2 polyprotein comprised a polypeptide sequence comprising about 90% of the contiguous amino acid residues of the sequence of the DEN-2 E protein as measured in sequence order from the N terminus of the E protein, as explained in detail below. The term “truncated E protein” is abbreviated as “tE” or “tE protein.” The invention also provides a pMaxVax10.1 vector comprising a nucleic acid of the invention that encodes a recombinant or synthetic dengue virus antigen of the invention; such nucleic acid is similarly substituted into the pMaxVax10.1 vector in the same position as the nucleic acid encoding the DEN-2 PRM15/truncated E protein polypeptide shown in FIG. 2.

FIG. 3 displays a Western blot of dengue virus antigens, each comprising a PRM15 signal peptide/truncated (t) E protein, expressed in the lysate (L) and secreted into the medium supernatant (SN) from 293 cells transfected with one of the following vectors: pMaxVax10.1_(Den-1PRM15/tE CO), pMaxVax10.1_(Den-2PRM15/tE CO), pMaxVax10.1_(Den-3PRM15/tE CO), and pMaxVax10.1_(Den-4PRM15/tE CO). The nitrocellulose membrane bound proteins were incubated with antibodies from mouse ascitic fluid for DEN-1, DEN-2, DEN-3, and DEN-4.

FIG. 4 includes a series of dot blots obtained by FACS analysis of 293 cells transfected with either pMaxVax10.1_(2/7) vector or pMaxVax10.1_(Den-3PRM15/tE CO) vector and separately incubated with mouse anti-DEN-1, DEN-2, DEN-3, or DEN-4 antisera.

FIG. 5 is a Western blot of recombinant PRM15/truncated E dengue virus antigens of the invention expressed from a pMaxVax10.1 DNA vector, compared to PRM15/truncated E antigens having wild-type (wt) amino acid sequences expressed from pMaxVax10.1_(Den-1PRM15/tE CO), pMaxVax10.1_(Den-2PRM15/tE CO), pMaxVax10.1_(Den-3PRM15/tE CO), and pMaxVax10.1_(Den-4PRM15/tE CO) vectors, bound with DEN-1, DEN-3, and DEN-4 murine antisera and appropriate secondary antibodies. Nucleotide sequences encoding these recombinant PRM15/tE dengue virus antigens were screened or selected from among recombinant nucleotide sequences in recombinant libraries produced by recursive sequence recombination. In the figure, the letter “C” refers to a control vector and represents the data obtained by transfecting 293 cells with pMaxVax10.1_(null) vector that does not include a dengue virus nucleotide sequence.

FIG. 6 shows the optical density (OD) values obtained by ELISA analysis of antisera obtained from mice, each of which had been injected with a pMaxVax10.1 vector comprising a representative recombinant (e.g., shuffled) nucleic acid sequence identified in libraries of recombinant nucleic acids using ELISA plates, coated with inactivated DEN-1, DEN-2, DEN-3, and DEN-4 viruses. The term “WT” refers to a pMaxVax10.1 vector comprising a nucleic acid sequence encoding PRM15/truncated envelope protein of a wild-type DEN-1, DEN-2, DEN-3, or DEN-4 polyprotein, respectively. The control vector comprises a pMaxVax10.1_(null) vector (termed “pMV”) lacking any dengue virus nucleic acid sequence.

FIG. 7 presents a comparison of ELISA OD values for antisera obtained from blood of mice-injected with a pMaxVax10.1 vector comprising a parental dengue virus PRM15/tE nucleic acid (encoding a WT DEN-1 DEN-2, DEN-3, or DEN-4 polypeptide sequence) or a select representative recombinant PRM15/tE nucleic acid encoding a recombinant PRM15/tE dengue virus antigen (Ag). Results using antisera obtained from mice injected with a mixture of four pMaxVax10.1 vectors, each vector including a nucleotide sequence encoding one of the four WT DEN-1PRM15/tE, DEN-2PRM15/tE, DEN-3PRM15/tE, and DEN-4PRM15/tE antigens (i.e., pMaxVax10.1_(DEN-1PRM15/tE CO), pMaxVax10.1_(DEN-2PRM15/tE CO), pMaxVax10.1_(DEN-3PRM15/tE CO), and pMaxVax10.1_(DEN-41PRM15/tE CO)), are also shown. The control (“pMV”) is a pMaxVax10.1 vector that lacks a parental or recombinant PRM15/tE nucleic acid.

FIG. 8A illustrates the results of reciprocal 50% plaque reduction neutralization titers (PRNT) for sera obtained from mice at day 76 after the initial injection at day 0 with a representative pMaxVax10.1 DNA plasmid vector comprising either one of the four WT DEN-1PRM15/tE, DEN-2PRM15/tE, DEN-3PRM15/tE, and DEN-4PRM15/tE antigens, or a mix of these 4 wild-type antigens, or a recombinant nucleotide sequence of the invention corresponding to one of the following antigens: 18E9, 18D7, 16G11, 18H2, 16B4, 6E12, 2G11, 2/7, 15D4, and 18H6. FIG. 8B illustrates the results of reciprocal 50% plaque reduction neutralization titers (PRNT) for sera obtained from mice at day 76 after the initial injection at day 0 with a representative pMaxVax10.1 DNA plasmid vector comprising either one of the four WT DEN-1, DEN-2, DEN-3, and DEN-4 antigens in the C15/full prM/full E antigen format, or a mix of these 4 wild-type antigens, or a recombinant nucleotide sequence of the invention corresponding to one of the following C15/full prM/full E antigens: 5/21-D1, 2G11-D4, and 6E12-D4.

FIG. 9 shows a Western blot analysis of nine secreted recombinant dengue virus polypeptide antigens of the invention, each encoded by a recombinant polynucleotide of the invention, that induced a neutralizing antibody response against wild-type dengue viruses of at least two dengue virus serotypes in vivo. “E” refers to envelope protein, and the arrow to the right of the blot indicates the position of the E protein in the blot. For each recombinant antigen, the number of different serotypes neutralized by the antibody response induced by the antigen is shown. The PRNT results are shown in FIGS. 8A and 8B.

FIG. 10 provides OD graphs obtained by ELISA analyses of antisera obtained from mice, each of which had been injected with a pMaxVax10.1 vector comprising a recombinant nucleotide sequence. The recombinant nucleotide sequences were generated by recursive sequence recombination of the human codon optimized dengue virus sequences (e.g., SEQ ID NOS:215-218). Each such recombinant nucleotide sequence encoded a recombinant dengue virus antigen having the following format: C15 signal sequence/full length prM protein/full length E protein. A pMaxVax10.1_(null) vector (“pMV”) served as the control vector.

FIG. 11 shows the results of a sequence diversity analysis (e.g., chimerism) of amino acid sequences of representative recombinant PRM15/truncated E dengue virus antigens of the invention that are encoded by recombinant nucleic acids of the invention, as compared with sequences of the parental WT DEN-1 PRM15/tE, DEN-2 PRM15/tE, DEN-3 PRM15/tE, and DEN-4 PRM15/tE proteins. This analysis confirms that these recombinant antigens include amino acid fragments or segments from all four parental WT protein sequences and thus constitute chimeras of the parental sequences.

FIG. 12 is a graph showing the percentage of mice that have survived challenge with DEN-2 virus over a period of 28 days following immunization with a recombinant pMaxVax10.1 vector of the invention. In this experiment, a recombinant pMaxVax10.1 vector comprising one of the following three recombinant PRM15/tE nucleic acid sequences of the invention and a mixture of these three nucleic acid sequences were evaluated: SEQ ID NOS:235 (18H6), SEQ ID NOS:204 (2G11-D4) and SEQ ID NOS: 202 (6E12-D4). For comparison, some mice were injected with a pMaxVax10.1 vector comprising a WT DEN-2 PRM15/tE nucleic acid sequence and WT DEN-2 C15/full prM/full E, or a WT DEN-3 C15/full prM/full E nucleic acid sequence, as well as a mix of all four DEN-1-4 WT PRM15/tE and C15/full prM/full E. Mice injected with PBS or a pMaxVax10.1_(null) vector served as control mice.

FIGS. 13A and 13B show dot blots of 10 μl to 100 μl of supernatant from 293 cells transfected with representative nucleic acids of the invention (e.g., 5/21 (SEQ ID NO:157), 2/7 (SEQ ID NO:156), 6E12 (SEQ ID NO:159) and 2G11 (SEQ ID NO:157) when reacted with mouse DEN-1, DEN-2, DEN-3, and DEN-4 antisera.

FIG. 14 is an exemplary graphical representation of a wild-type dengue virus gene and dengue virus structural proteins encoded therefrom (capsid, prM, and E proteins).

FIGS. 15A and 15B show DEN-3 WT in C15/full prM/full E protein format and selected recombinant polypeptides of the invention (18H6 in PRM15/truncated E format and 16G11-25B10 in C15/full prM/full E format) purified by centrifugation through 20%-60% sucrose gradients. The purified polypeptides are either stained with Coomassie Blue (15A) in a polyacrylamide (PAA) gel or with DEN-specific antibodies after Western Blot transfer to nitrocellulose filters (15B) using standard techniques (see, e.g., Rapley, R. and Walker, J. M. eds., MOLECULAR BIOMETHODS HANDBOOK (1998), Humana Press, Inc., Tijssen (1993) LABORATORY TECHNIQUES IN BIOCHEMISTRY AND MOLECULAR BIOLOGY—HYBRIDIZATION WITH NUCLEIC ACID PROBES [hereinafter Rapley and Walker, MOLECULAR BIOMETHODS HANBBOOK]. E represents “envelope” protein. Bovine serum albumin (BSA) was used as a control in the gel and Western Blot. Bands suggesting formation of a dimer polypeptide for each of DEN-3 C15/full prM/full E and 16G11-25B10 are shown.

FIG. 16 presents a comparison of ELISA OD values for antisera obtained from blood of mice immunized by injection with either a pMaxVax10.1 vector encoding one of the following recombinant polypeptide antigens of the invention (18H6 (SEQ ID NO:235) or 16G11-25B10 (SEQ ID NO:255)) or by sucrose gradient purified polypeptides of the invention (18H6 (SEQ ID NO:110) or 16G11-25B10 (SEQ ID NO:251)) in alum adjuvant. The mice received 2 booster immunizations in 3-week intervals, either with a pMaxVax10.1 vector encoding one of the following recombinant polypeptide antigens of the invention (e.g., DNA+DNA), or with polypeptides in alum after initial injection with DNA (e.g., DNA+Protein) or protein (e.g., Protein+Protein). “Prot.” represents “protein.”

FIG. 17 presents a comparison of ELISA optical density values for antisera obtained at different time points (days 0, 28, 56, and 112) from blood of 6 monkeys per group, individually injected at days 0, 28, and 84 with: (1) a mixture comprising four pMaxVax10.1 nucleic acid vectors in PBS, each vector comprising one of four parental WT dengue virus C15/full length prM/full length E nucleic acids (4WT, Mix) encoding a WT DEN-1 DEN-2, DEN-3, or DEN-4 C15/full length prM/full length E polypeptide, respectively; (2) a pMaxVax10.1 vector comprising a representative recombinant PRM15/tE nuclei acid (18H6) that encodes a recombinant PRM15/tE dengue virus antigen; (3) a pMaxVax10.1 vector comprising a representative PRM C15/full length prM/full length E nucleic acid (6E12-D4 and 2G11-D4), each of which encodes a C15/full length prM/full length E dengue virus antigen; or (4) a mixture of three pMaxVax10.1 vectors in PBS, each vector comprising a nucleic acid corresponding to one of 18H6, 6E12-D4, and 2G11-D4 (3Sh, Mix). The control (“pMV ctrl”) is a pMaxVax10.1 vector that lacks a parental or recombinant PRM15/tE or C15/full length prM/full length E nucleic acid. All nucleic acid vectors were administered as compositions in PBS.

DETAILED DESCRIPTION OF THE INVENTION

The invention provides novel recombinant, synthetic, mutant, and/or isolated polypeptides, fusion proteins, antibodies, nucleic acids (e.g., polynucleotides), viruses, virus-like particles; vectors comprising such nucleic acids and/or encoding such polypeptides, fusion proteins, antibodies, viruses, and virus-like particles; cells and compositions comprising such nucleic acids, polypeptides, fusion proteins, antibodies, viruses, virus-like particles and/or vectors; and vaccines comprising one or more of the aforementioned nucleic acids, polypeptides, fusion proteins, antibodies, viruses, virus-like particles, vectors, cells, and compositions of the invention. The invention further provides methods of making and methods of using such nucleic acids, polypeptides, fusion proteins, antibodies, viruses, virus-like particles, vectors, cells, and compositions of the invention.

In one aspect, the nucleic acids, polypeptides, fusion proteins, vectors, viruses, virus-like particles, cells, antibodies and compositions are generally useful in modulating, promoting, inducing, and/or enhancing an immune response(s) to one or more flaviviruses and/or one or more dengue viruses of one or more serotypes, including, but not limited to, e.g., dengue-1, dengue-2, dengue-3, and dengue-4, or variants thereof, and/or analyzing biological samples for the presence of anti-flavivirus antibodies against at least one flavivirus of at least one flavivirus serotype or variant thereof, and more particularly anti-dengue virus antibodies against at least one of DEN-1 DEN-2, DEN-3, and DEN-4 viruses or a variant thereof. The nucleic acids, polypeptides, fusion proteins, vectors, viruses, virus-like particles, antibodies, cells, compositions, and methods of the invention described herein are also believed useful in in vivo methods for the prophylactic and/or therapeutic treatment of animals (including, e.g., vertebrates and mammals) of a disease(s) associated with at least one flavivirus of at least one serotype or a variant thereof (including, e.g., at least one dengue virus of at least one serotype or variant thereof), and in methods for the in vitro, ex vivo, and/or in vivo diagnosis, detection, and/or identification at least one flavivirus or variant thereof (including, e.g., at least one dengue virus of at least one serotype or variant thereof).

In one aspect, the nucleic acids, polypeptides, fusion proteins, vectors, viruses, virus-like particles, antibodies, cells, compositions, and methods of the invention are particularly useful in in vivo, and/or ex vivo methods of inducing or enhancing an immune response in an animal to at least one virus of the Flaviviridae family of viruses (e.g., a member of the Flavivirus genus such as Japanese encephalitis virus) and methods for the prophylactic and/or therapeutic treatment of animals (including, e.g., vertebrates and mammals) of a disease(s) associated with at least one virus that is a member of the Flaviviridae family of viruses, which includes flaviviruses, pestiviruses, and hepaciviruses, or variant of any such virus thereof, preferably wherein said at least one virus or variant thereof is a virus that is related to at least one dengue virus of at least one serotype (including, e.g., but not limited to, a virus of the Flaviviridae family, such as a member of the Flavivirus genus or another flavivirus, including, e.g., a yellow fever virus, St. Louis encephalitis virus, Japanese encephalitis virus, tick-borne encephalitis virus, Murray Valley encephalitis virus, Russian spring-summer encephalitis virus, and/or West Nile virus, and including, e.g., those viruses that are described as being related to dengue viruses in FIELDS VIROLOGY, supra, Vol. 1, Chapters 32 and 33 (4^(th) ed.), or any variant of any such virus).

In another aspect, the nucleic acids, polypeptides, fusion proteins, vectors, viruses, virus-like particles, antibodies, cells, compositions, and methods of the invention are useful in methods for the in vitro, ex vivo, and/or in vivo diagnosis, detection, and/or identification at least one virus of the Flaviviridae family (e.g., which consists of three genera, flavivirus, pestivirus, and hepacivirus) or a variant of any such virus thereof, preferably wherein said at least one virus or variant thereof is a virus that is related to at least one dengue virus (including, e.g., but not limited to, a virus of the Flaviviridae family, another flavivirus, yellow fever viruses, St. Louis encephalitis viruses, Japanese encephalitis viruses, tick-borne encephalitis viruses, Murray Valley encephalitis virus, Russian spring-summer encephalitis virus, and/or West Nile viruses, and including, e.g., those viruses that are described as being related to dengue viruses in FIELDS VIROLOGY, supra, Vol. 1, Chapters 32 and 33 (4^(th) ed.), or any variant of any such virus). As such, it will be understood that a reference to a dengue virus in the following detailed description can be construed as relating more generally to a virus of the Flaviviridae virus family, including a flavivirus, pestivirus, or hepacivirus, particularly to member of the Flavivirus genus, preferably to a flavivirus of at least one serotype, and especially to a flavivirus that is related or, preferably, closely related to at least one dengue virus of at least one serotype, unless otherwise stated or clearly contradicted by context. It will be understood that a reference to a polypeptide in the following detailed description can be construed, depending upon the context, as including a fusion protein.

The polypeptides, fusion proteins, nucleic acids, vectors, viruses, virus-like particles, compositions, cells, and methods of the invention are also useful in in vivo and/or ex vivo methods of inducing an immune response in animals and/or in in vivo and/or ex vivo methods of immunization of animals (including, e.g., vertebrates and mammals) against at least one virus of the Flaviviridae family, e.g., flavivirus, or variant thereof (including, e.g., a flavivirus that comprises or is related to at least one dengue virus of at least one dengue virus serotype or a variant thereof), and/or as a vaccine against at least one virus of the Flaviviridae family, e.g., flavivirus, or variant thereof, and/or more particularly as a vaccine against a virus of the Flaviviridae family, e.g., flavivirus, that comprises or is related to at least one dengue virus or variant thereof.

Advantageously, recombinant, synthetic, mutant, and/or isolated polypeptides and/or fusion proteins provided by the invention comprise an amino acid sequence that is capable of modulating, inducing, promoting, and/or enhancing a detectable immune response(s), such as the production of antibodies that bind to at least one virus of the Flaviviridae family, e.g., flavivirus (e.g., dengue virus), and/or the production of at least one type of antigen relating to the Flaviviridae family, e.g., flavivirus antigen (including, e.g., at least one type of dengue virus antigen) in animal cells (typically vertebrate cells, and more typically and preferably, mammalian cells, such as human and nonhuman primate cells) in vitro in cell culture and/or ex vivo and/or in vivo in a subject or tissue or cells obtained therefrom. Such an amino acid sequence portion of the polypeptide or fusion protein can be referred to as an “immunogenic amino acid” or “antigenic amino acid” or simply “the amino acid” or an “immunogen” or “antigen” of the invention.

A further desirable feature of the recombinant, synthetic, mutant, and/or isolated polypeptides of the invention, and the respective polynucleotides of the invention that encode such polypeptides, is the ability to induce, promote, modulate, and/or enhance an immune response(s) to at least one flavivirus, including, e.g., to at least one dengue virus of at least one serotype, and preferably to at least one dengue virus of each of at least two serotypes, more preferably to at least one dengue virus of each of at least three serotypes, and even more preferably to at least one dengue virus of each of at least four known virus dengue virus serotypes (e.g., DEN-1, DEN-2, DEN-3, and/or DEN-4).

A further desirable feature of the recombinant synthetic, mutant, and/or isolated polypeptides of the invention, and the polynucleotides that encode such polypeptides, is the ability to induce, promote, enhance or modulate a neutralizing antibody response(s) against at least one flavivirus, preferably against at least one dengue virus of at least one serotype, more preferably against at least one dengue virus of each of at least two serotypes, even more preferably against at least one dengue virus of each of at least three serotypes, and most preferably against at least one dengue virus of each of at least the four known virus dengue virus serotypes (e.g., DEN-1, DEN-2, DEN-3, and/or DEN-4).

More particular and desirable characteristics of the recombinant, synthetic, mutant, and/or isolated polypeptides and/or fusion proteins of the invention, polynucleotides encoding these and other polypeptides and fusion proteins, and related cells, antibodies, vectors, compositions, diagnostic assays and methods of making and using such polypeptides and polynucleotides are described in detail herein.

The term “PRM15” (or “prM15” and sometimes alternatively referred to as “spM”) when used with reference to an amino acid generally refers to an amino acid sequence typically comprising about 15 amino acid residues. In some embodiments, the PRM15 amino acid sequence includes, in addition, a methionine (“Met” or “M”) as the first amino acid residue; in such embodiments, the PRM15 sequence comprises a total of about 16 amino acids. Usually, a PRM15 amino acid sequence comprises the last 15 amino acid residues of the C terminus of a wild-type (WT), recombinant, mutant, or synthetic dengue virus prM protein, or variant thereof; in some such embodiments, the PRM15 amino acid sequence further comprises a methionine as the first amino acid residue of the sequence and thus is about 16 amino acids in length.

In reference to a nucleotide sequence, “PRM15” generally refers to a nucleotide sequence typically comprising about 15 amino acid residues. In some embodiments, the PRM15 nucleotide sequence includes three nucleotides residues that encode a methionine, which three residues are positioned as the first three residues of the sequence, followed in sequence order by the codons encoding the remaining approximately 15 amino acids. Usually, a PRM15 nucleotide sequence comprises nucleic acid residues that encode approximately the last 15 amino acids of the C terminus of a WT, recombinant, mutant, or synthetic dengue virus prM protein or variant thereof. In some such embodiments, the PRM15 nucleotide sequence further includes three residues that encode a methionine, which are positioned at the beginning of the nucleotide sequence; in such embodiments, the nucleotide sequence encodes a total of about 16 amino acids.

The PRM15 amino acid sequence is typically linked to a polypeptide and effectively acts as a signal sequence for transport of such polypeptide in a cell. The PRM15 signal sequence is usually cleaved from the polypeptide subsequently during processing. For example, in some embodiments, the PRM15 sequence is linked to a dengue virus envelope (“E” or “Env”) protein, such as a WT, recombinant, mutant, or synthetic dengue virus E protein or to an antigenic or immunogen-encoding fragment (e.g., truncated recombinant, WT, or synthetic E protein) or variant thereof.

A variety of PRM15 amino acid signal sequences are described herein (see, e.g., SEQ ID NOS:52-64). A variety of PRM15 nucleotide sequences are also described herein (see, e.g., SEQ ID NOS:272-284). A PRM15 amino acid sequence can be readily determined from a prM protein sequence by identifying, e.g., approximately the last 15 amino acid residues of the C terminus of the prM sequence. If desired, a Met residue can be added at the beginning of the amino acid sequence. A PRM15 nucleic acid sequence can be similarly determined by identifying the nucleic acid residues that encode approximately the last 15 amino acid residues of the C terminus of a particular prM sequence; if desired, a nucleic acid codon encoding a Met residue can be similarly added as the first codon of such nucleic acid sequence. PRM15 amino acid and nucleic acid sequences can be synthesized by using standard protein and nucleic acid synthesis techniques, respectively, as would be know to one of skill in the art and as described below.

In one aspect, the invention includes a chimeric polypeptide comprising a sequence that has at least about 90% amino acid sequence identity to a polypeptide sequence comprising from amino acid residue 16 to the last amino acid residue of the amino acid sequence of a synthetic or recombinant C15/full prM/full E or PRM15/truncated E polypeptide of the invention, and nucleic acid encoding said chimeric polypeptide.

The term “tE” (or “ETRUNC” or “E-truncated” or the like) when used with reference to an amino acid sequence refers to a truncated (“t” or “trunc”) envelope (E) protein of a flavivirus. Such flavivirus can be an isolated wild-type, recombinant, mutant, or synthetic flavivirus or variant thereof. In one aspect, the flavivirus comprises a dengue virus, and the truncated E protein comprises a truncated wild-type dengue virus E protein or a variant thereof, or a truncated recombinant, mutant, or synthetic dengue virus E protein. Compared to a non-truncated E protein, a truncated E protein lacks one or more amino acid residues of the C terminus of the non-truncated form of the E protein. When used with reference to a nucleic acid sequence, the term “tE” (or “ETRUNC” or “E-truncated” or the like) refers to a truncated nucleic acid sequence that corresponds to or encodes a truncated E protein. A nucleic acid sequence that encodes a truncated E protein similarly lacks one or more nucleic acid residues from the C terminus compared to the non-truncated nucleic acid.

In one embodiment, the truncated dengue envelope (E) protein (DEN-1, DEN-2, DEN-3, or DEN-4) comprises a polypeptide sequence comprising from about 70% to about 98% (e.g., about 85%, 86%, 87%, 88%, 90%, 92%, or 95%) of the contiguous amino acid residues of the respective dengue E protein as measured in sequence order from the N terminus amino acid residue of the amino acid sequence of the respective dengue E protein. In other words, the truncated E protein comprises a fragment of a dengue E protein sequence; such fragment lacks the amino acid residues corresponding to from about 2% to about 30% (e.g., about 15%, 14%, 13%, 12%, 10%, 8%, or 5%) of the amino acid residues of the respective full-length dengue E protein sequence, as measured from the C terminus amino acid residue of the full-length dengue E protein sequence. A nucleic acid sequence encoding such a truncated dengue E protein does not include the nucleotide residues encoding from about 2% to about 30% (e.g., about 15%, 14%, 13%, 12%, 10%, 8%, or 5%) of the amino acid residues of the C terminus of the respective full length dengue E protein sequence, as measured from the C terminus amino acid residue of the full length E protein sequence.

In one particular embodiment, the truncated DEN-1 E protein comprises a polypeptide sequence comprising from about 87% of the contiguous amino acid residues of the DEN-1 E protein as measured in sequence order from the N terminus amino acid residue, and the nucleic acid sequence encoding such a truncated DEN-1 E protein excludes nucleotide residues encoding about the last 13% of the amino acid residues of the C terminus of the DEN-1 E protein sequence, as measured from the C terminus amino acid residue.

In one particular embodiment, the truncated DEN-2 E protein comprises a polypeptide sequence comprising from about 90% of the contiguous amino acid residues of the DEN-2 E protein as measured in sequence order from the N terminus amino acid residue, and the nucleic acid sequence encoding such a truncated DEN-2 E protein excludes nucleotide residues encoding about the last 10% of the amino acid residues of the C terminus of the DEN-2 E protein sequence, as measured from the C terminus amino acid residue.

In one particular embodiment, the truncated DEN-3 E protein comprises a polypeptide sequence comprising from about 89% of the contiguous amino acid residues of the DEN-3 E protein as measured in sequence order from the N terminus amino acid residue, and the nucleic acid sequence encoding such a truncated DEN-3 E protein excludes nucleotide residues encoding about the last 11% of the amino acid residues of the C terminus of the DEN-3 E protein sequence, as measured from the C terminus amino acid residue.

In one embodiment, the truncated DEN-4 E protein comprises a polypeptide sequence comprising from about 90% of the contiguous amino acid residues of the DEN-4 E protein as measured in sequence order from the N terminus amino acid residue, and the nucleic acid sequence encoding such a truncated DEN-4 E protein excludes nucleotide residues encoding about the last 10% of the amino acid residues of the C terminus of the DEN-4 E protein sequence, as measured from the C terminus amino acid residue.

The term “CO” refers to “codon optimization” or a “codon optimized” sequence. When used with reference to a nucleic acid sequence, the term refers to a codon optimized nucleic acid sequence. When used with reference to an amino acid sequence, the term refers to an amino acid sequence that corresponds to or is encoded by a codon optimized nucleic acid sequence.

The terms “polypeptide,” “protein,” and “peptide,” herein and used throughout synonymously refer to any polymer formed from multiple amino acids associated, at least in part, by covalent bonding (e.g., “protein” as used herein refers both to linear polymers (chains) of amino acids associated by peptide bonds as well as proteins exhibiting secondary, tertiary, or quaternary structure, which can include other forms of intramolecular and intermolecular association, such as hydrogen and van der Waals bonds, within or between peptide chain(s)), unless otherwise stated. The term “polypeptide” and “protein” includes fusion proteins, unless otherwise stated.

The term “recombinant” when used with reference to an amino acid (e.g., peptide, polypeptide, protein, antigen) typically refers to a non-naturally occurring amino acid (i.e., not found in nature) (e.g., non-naturally occurring peptide, polypeptide, protein, antigen). A “recombinant polypeptide” includes any polypeptide expressed or capable of being expressed from a recombinant nucleic acid (however, the recombinant nucleic acid sequence need not include all of the coding or nucleotide sequence elements necessary for expression), or any polypeptide comprising an amino acid sequence, wherein each amino acid residue of the sequence corresponds to or is capable of being encoded by a codon of a nucleic acid sequence, including a recombinant nucleic acid sequence.

The term “recombinant” when used with reference to a nucleic acid (e.g., a polynucleotide or other nucleotide) typically refers to a non-naturally occurring nucleic acid. When used with reference to a nucleic acid, the term “recombinant” may indicate that the nucleic acid has been modified by the introduction of at least one exogenous (i.e., foreign, and typically heterologous) nucleotide or the alteration of at least one native nucleotide component of the nucleic acid. A “recombinant vector” refers to a non-naturally occurring vector, including, e.g., a vector comprising a recombinant nucleic acid sequence.

In one aspect, a “recombinant polynucleotide” or a “recombinant polypeptide” is a non-naturally occurring polynucleotide or polypeptide. A recombinant polynucleotide or recombinant polypeptide may include nucleic acids or amino acids, respectively, from more than one source nucleic acid or polypeptide, which source nucleic acid or polypeptide can be a naturally occurring nucleic acid or polypeptide, or can itself have been subjected to mutagenesis, alteration, recombination, or other type of modification. The source polynucleotides or polypeptides from which the different nucleic acid or amino acid sequences are derived are sometimes homologous (i.e., have, or encode a polypeptide that encodes, the same or a similar structure and/or function), and are often from different isolates, serotypes, strains, species, of organism or from different disease states, for example.

The term “recombinant” when used with reference to a cell indicates that the cell comprises a recombinant molecule, such as a recombinant nucleic acid, recombinant polypeptide, or recombinant vector (e.g., non-naturally occurring nucleic acid, polypeptide, or vector). In one aspect, the term “recombinant” when used with reference to a cell indicates that the cell replicates a heterologous nucleic acid, or expresses a peptide or protein encoded by a heterologous nucleic acid. Recombinant cells can contain genes that are not found within the native (non-recombinant) form of the cell. Recombinant cells can also contain genes found in the native form of the cell wherein the genes are modified and re-introduced into the cell by artificial means. The term also encompasses cells that comprise a nucleic acid endogenous to the cell that has been modified without removing the nucleic acid from the cell; such modifications include, e.g., those obtained by gene replacement, site-specific mutation, and related techniques.

The term “synthetic” in reference to a molecule or component means an artificial or non-naturally occurring molecule or component, respectively. For example, a synthetic polynucleotide is an artificial, non-naturally occurring polynucleotide. Techniques for synthetically producing a molecule, component, or combination thereof, including, e.g., a synthetic polynucleotide, polypeptide, fusion protein, vector, virus, virus-like particle, cell, composition, and the like, are further described herein.

For ease of readability, recombinant, synthetic, mutant and/or variant polypeptides, polynucleotides, fusion proteins, vectors, cells, and antibodies of the invention are often referred to simply as “recombinant” polypeptides, polynucleotides, fusion proteins, vectors, cells, and antibodies, respectively.

The term “nucleic acid” refers to deoxyribonucleotides or ribonucleotides and polymers thereof in either single- or double-stranded form. Unless specifically limited, the term encompasses nucleic acids containing known analogues of natural nucleotides which have similar binding properties as the reference nucleic acid and are metabolized in a manner similar to naturally occurring nucleotides. Unless otherwise indicated, a particular nucleic acid sequence also implicitly encompasses conservatively modified variants thereof (e.g., degenerate codon substitutions) and complementary sequences and as well as the sequence explicitly indicated. Specifically, degenerate codon substitutions may be achieved by generating sequences in which the third position of one or more selected (or all) codons is substituted with mixed-base and/or deoxyinosine residues (Batzer et al. (1991) Nucleic Acid Res 19:5081; Ohtsuka et al. (1985) J Biol Chem 260:2605-2608; Cassol et al. (1992); Rossolini et al. (1994) Mol Cell Probes 8:91-98). The term nucleic acid is used interchangeably with polynucleotide, and (in appropriate contexts) gene, cDNA, and mRNA encoded by a gene.

“Nucleic acid derived from a gene” refers to a nucleic acid for whose synthesis the gene, or a subsequence thereof, has ultimately served as a template. Thus, an mRNA, a cDNA reverse transcribed from an mRNA, an RNA transcribed from that cDNA, a DNA amplified from the cDNA, an RNA transcribed from the amplified DNA, etc., are all derived from the gene and detection of such derived products is indicative of the presence and/or abundance of the original gene and/or gene transcript in a sample.

A nucleic acid is “operably linked” with another nucleic acid sequence when it is placed into a functional relationship with another nucleic acid sequence. For instance, a promoter or enhancer is operably linked to a coding sequence if it increases the transcription of the coding sequence. Operably linked means that the DNA sequences being linked are typically contiguous and, where necessary to join two protein coding regions, contiguous and in reading frame. However, since enhancers generally function when separated from the promoter by several kilobases and intronic sequences may be of variable lengths, some polynucleotide elements may be operably linked but not contiguous.

An “isolated” polypeptide refers to a polypeptide separated from one or more components and/or the environment with which it is normally associated (e.g., other peptides, polypeptides, proteins (including complexes, cellular contaminants, cellular components, etc.), cells, etc.). An “isolated” nucleic acid (or an isolated nucleotide or isolated polynucleotide) refers to a nucleic acid (or nucleotide or polynucleotide) that is isolated from one or more components and/or the environment with which it normally associates. Typically, an isolated nucleic acid refers to a nucleic acid that is not immediately contiguous with one or more nucleic acids with which it is immediately contiguous (i.e., at the 5′ and/or 3′ end) in the sequence from which it is obtained and/or derived.

Typically, isolation of a component (e.g., polypeptide, polynucleotide, fusion protein, vector, cell) renders it the predominant component present in a composition, mixture, or collection of components; i.e., on a molar basis it is more abundant than any other individual species in the composition. For example, isolation of a polypeptide or polynucleotide renders the polypeptide or polynucleotide, respectively, the predominant molecule or species present in a composition, mixture, or collection of molecules. Such a “substantially pure” polypeptide or polynucleotide (or other component), for example, typically forms at least about 50%, at least about 60%, at least about 70%, at least about 80%, at least about 90%, at least about 95%, by weight (typically on a molar basis), of all macromolecular species present in a particular composition. Desirably, the substantially pure polypeptide or polynucleotide exhibits essential homogeneity (i.e., contaminant species cannot be detected in the composition by conventional detection methods). The term “purified” generally denotes that a polynucleotide or polypeptide is free or at least substantially free of other components as determined by standard analytical techniques (e.g., forms a band electrophoretic gel, chromatographic eluate, and/or a media subjected to density gradient centrifugation) and/or forms at least about 80%, at least about 85%, preferably at least about 90%, and more preferably at least about 95%, of the macromolecular species in a particular composition.

The term “subject” as used herein includes, but is not limited to, an organism; an animal, including a mammal, which includes, e.g., a human, non-human primate (e.g., baboon, orangutan, chimpanzee, monkey), mouse, pig, cow, goat, cat, rabbit, rat, guinea pig, hamster, horse, monkey, sheep, or other non-human mammal, and a non-mammal, including, e.g., a non-mammalian vertebrate, such as a bird (e.g., a chicken or duck) or a fish, and a non-mammalian invertebrate.

The term “cytokine” includes, for example, interleukins, interferons, chemokines, hematopoietic growth factors, tumor necrosis factors and transforming growth factors. In general these are small molecular weight proteins that regulate maturation, activation, proliferation, and differentiation of cells of the immune system.

A “variant” of a polypeptide is a polypeptide that differs in one or more amino acid residues from a parent or reference polypeptide, usually in at least about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 25, 50, 75, 100 or more amino acid residues.

A “variant” of a nucleic acid is a nucleic acid that differs in one or more nucleic acid residues from a parent or reference nucleic acid, usually in at least about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 17, 20, 21, 24, 27, 30, 33, 36, 39, 40, 45, 50, 60, 75, 150, 225, 300 or more nucleic acid residues.

An “antigen” refers to a molecule that is capable of inducing, promoting, enhancing, or modulating an immune response or immune reaction. In some instances, the immune response or immune reaction is a humoral and/or cellular response. An antigen may induce, promote, enhance or modulate an immune response or immune reaction in cells in vitro and/or in vivo in a subject and/or ex vivo in a subject's cells or tissues. Such immune response or reaction may include, but is not limited to, eliciting the formation of antibodies in a subject, or generating a specific population of lymphocytes reactive with the antigen. Antigens are typically macromolecules (e.g., proteins and polysaccharides) that are foreign to the host.

A “subsequence” or “fragment” of nucleic acids or amino acids refers to a sequence of nucleic acids or amino acids, respectively, that comprises any part or segment of a longer sequence of nucleic acids (e.g., polynucleotide) or amino acids (e.g., polypeptide), respectively, up to and including the complete (entire) nucleic acid sequence or complete amino acid sequence.

An “adjuvant” refers to a molecule or substance that augments or enhances an immune response, including, for example, but not limited to, an antigen's immune-stimulating properties or the pharmacological effect(s) of a compound or drug. An adjuvant may non-specifically enhance an immune response, e.g., the immune response to an antigen. “Freund's Complete Adjuvant,” for example, is an emulsion of oil and water containing an immunogen, an emulsifying agent and mycobacteria. Another example, “Freund's incomplete adjuvant,” is the same, but without mycobacteria. An adjuvant may comprise oils, emulsifiers, killed bacteria, aluminum hydroxide, or calcium phosphate (e.g., in gel form), or combinations thereof. An adjuvant may be administered into a subject (e.g., via injection intramuscularly or subcutaneously) in an amount sufficient to produce antibodies.

“Naturally occurring” as applied to an object refers to the fact that the object can be found in nature as distinct from being artificially produced by man. A polypeptide or polynucleotide sequence that is present in an organism (including viruses, bacteria, protozoa, insects, plants or mammalian tissue) that can be isolated from a source in nature and which has not been intentionally modified by man in the laboratory is naturally occurring. Non-naturally occurring as applied to an object means that the object is not naturally-occurring—i.e., the object cannot be found in nature as distinct from being artificially produced by man.

Numbering of a given amino acid polymer or nucleotide polymer “corresponds to numbering” of a selected amino acid polymer or nucleic acid polymer when the position of any given polymer component (e.g., amino acid residue, nucleotide residue) is designated by reference to the same or an equivalent residue position in the selected amino acid or nucleotide polymer, rather than by the actual position of the component in the given polymer. Thus, for example, the numbering of a given amino acid position in a given polypeptide sequence corresponds to the same or equivalent amino acid position in a selected polypeptide sequence used as a reference sequence.

A vector is a component or composition for facilitating cell transduction or transfection by a nucleic acid, or expression of the nucleic acid in the cell. Vectors include, e.g., plasmids, cosmids, viruses, YACs, bacteria, poly-lysine, etc. An “expression vector” or “expression cassette” is a nucleic acid construct or sequence with nucleic acid elements that permit transcription of a nucleic acid in a host cell and/or that are capable of effecting expression of a nucleic acid in a host compatible with such construct or sequence. An expression vector or cassette can be generated recombinantly or synthetically by methods known in the art. The expression vector or cassette can be part of a plasmid, virus, or nucleic acid fragment. The expression vector or expression cassette typically includes a nucleic acid to be transcribed (e.g., a nucleic acid encoding a desired polypeptide) operably linked to a promoter. The nucleic acid to be transcribed is typically under the direction or control of the promoter. An expression vector or cassette optionally includes transcription termination signal(s). Additional factors necessary or helpful in effecting expression may also be used as described herein. For example, an expression vector or cassette can also include nucleotide sequences that encode a signal sequence that directs secretion of an expressed protein from the host cell. Enhancers and other nucleic acid sequences that influence nucleotide expression or gene expression can also be included.

“Substantially the entire length of a polynucleotide sequence” or “substantially the entire length of a polypeptide sequence” refers to at least about 50%, generally at least about 60%, 70%, or 75%, usually at least about 80% or 85%, or typically at least about 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 99.5% or more of a length of a polynucleotide sequence or polypeptide sequence.

The term “pharmaceutical composition” means a composition suitable for pharmaceutical use in a subject, including an animal or human. A pharmaceutical composition generally comprises an effective amount of an active agent and a carrier, including, e.g., a pharmaceutically acceptable carrier.

The term “effective amount” means a dosage or amount of a molecule or composition sufficient to produce a desired result. For example, the desired result may comprise a measurable or testable induction, promotion, enhancement or modulation of an immune response in a subject to whom a dosage or amount of a particular antigen or immunogen (or composition thereof) has been administered. In one aspect, the desired result may comprise a measurable or testable induction, promotion, enhancement of an immune response in a subject to whom a dosage or amount of a particular viral antigen or immunogen (or composition thereof) has been administered sufficient to protect the subject against challenge by a virus. In another aspect, the desired result may comprise an objective or subjective improvement in the subject receiving a dosage or amount of a particular molecule or composition (e.g., the subject to whom the dosage or amount of the particular molecule or composition is administered).

A “prophylactic treatment” is a treatment administered to a subject who does not display and/or suffer from signs or symptoms of a disease, pathology, or medical disorder, or displays and/or suffers from only early signs or symptoms of a disease, pathology, or disorder, such that treatment is administered for the purpose of diminishing, preventing, and/or decreasing the risk of developing the disease, pathology, or medical disorder. A prophylactic treatment functions as a preventative treatment against a disease or disorder. A “prophylactic activity” is an activity of an agent, such as a nucleic acid, vector, gene, polypeptide, protein, molecule, substance, or composition thereof, that when administered to a subject who does not display and/or suffer from signs or symptoms of pathology, disease, or disorder, or who displays and/or suffers from only early signs or symptoms of pathology, disease, or disorder, can diminish, prevent, and/or decrease the risk of the subject developing such pathology, disease, or disorder. A “prophylactically useful” agent or molecule (e.g., nucleic acid or polypeptide) refers to an agent or molecule useful in diminishing, preventing, immunizing against, treating, and/or decreasing development of a pathology, disease, or disorder.

A “therapeutic treatment” is a treatment administered to a subject who displays and/or suffers from symptoms or signs of pathology, disease, or disorder, for the purpose of diminishing, treating, and/or eliminating those signs or symptoms of pathology, disease, or disorder. A “therapeutic activity” is an activity of an agent, such as a nucleic acid, vector, gene, polypeptide, protein, molecule, substance, or composition thereof, that eliminates, treats, and/or diminishes signs or symptoms of pathology, disease, or disorder, when administered to a subject displaying and/or suffering from such signs or symptoms. A “therapeutically useful” agent or molecule (e.g., nucleic acid or polypeptide) is an agent or molecule that is useful in diminishing, treating, and/or eliminating such signs or symptoms of a pathology, disease, or disorder.

The term “gene” broadly refers to any segment of DNA associated with a biological function. Genes include coding sequences and/or regulatory sequences required for their expression. Genes also include non-expressed DNA nucleic acid segments that, e.g., form recognition sequences for other proteins (e.g., promoter, enhancer, or other regulatory regions). Genes can be obtained from a variety of sources, including cloning from a source of interest or synthesizing from known or predicted sequence information, and may include sequences designed to have desired parameters.

Generally, the nomenclature used hereafter and the laboratory procedures in cell culture, molecular genetics, molecular biology, nucleic acid chemistry, and protein chemistry described below are those well known and commonly employed by those of ordinary skill in the art. Standard techniques, such as described in Sambrook et al., Molecular Cloning—A Laboratory Manual (2nd Ed.), Vols. 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y., 1989 (hereinafter “Sambrook”) and Current Protocols in Molecular Biology, F. M. Ausubel et al., eds., Current Protocols, a joint venture between Greene Publishing Associates, Inc. and John Wiley & Sons, Inc. (1994, supplemented through 1999) (hereinafter “Ausubel”), are used for recombinant nucleic acid methods, nucleic acid synthesis, cell culture methods, and transgene incorporation, e.g., electroporation, injection, gene gun, impressing onto or through the skin or tissue of a subject, and lipofection. Generally, oligonucleotide synthesis and purification steps are performed according to specifications. The techniques and procedures are generally performed according to conventional methods in the art and various general references which are provided throughout this document. The procedures therein are believed to be well known to those of ordinary skill in the art and are provided for the convenience of the reader.

As used herein, an “antibody” refers to a protein comprising one or more polypeptides substantially or partially encoded by immunoglobulin genes or fragments of immunoglobulin genes. The term antibody is used to mean whole antibodies and binding fragments thereof. The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, epsilon and mu constant region genes, as well as myriad immunoglobulin variable region genes. Light chains are classified as either kappa or lambda. Heavy chains are classified as gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, IgM, IgA, IgD and IgE, respectively. A typical immunoglobulin (e.g., antibody) structural unit comprises a tetramer. Each tetramer is composed of two identical pairs of polypeptide chains, each pair having one “light” (about 25 KDa) and one “heavy” chain (about 50-70 KDa). The N-terminus of each chain defines a variable region of about 100 to 110 or more amino acids primarily responsible for antigen recognition. The terms variable light chain (VL) and variable heavy chain (VH) refer to these light and heavy chains, respectively.

Antibodies exist as intact immunoglobulins or as a number of well characterized fragments produced by digestion with various peptidases. Thus, for example, pepsin digests an antibody below the disulfide linkages in the hinge region to produce F(ab)′2, a dimer of Fab which itself is a light chain joined to VH-CH1 by a disulfide bond. The F(ab)′2 may be reduced under mild conditions to break the disulfide linkage in the hinge region thereby converting the (Fab′)₂ dimer into an Fab′ monomer. The Fab′ monomer is essentially an Fab with part of the hinge region. The Fc portion of the antibody molecule corresponds largely to the constant region of the immunoglobulin heavy chain, and is responsible for the antibody's effector function (see, Fundamental Immunology, W. E. Paul, ed., Raven Press, N.Y. (1993), for a more detailed description of other antibody fragments). While various antibody fragments are defined in terms of the digestion of an intact antibody, one of skill will appreciate that such Fab′ fragments may be synthesized de novo either chemically or by utilizing recombinant DNA methodology. Thus, the term antibody, as used herein also includes antibody fragments either produced by the modification of whole antibodies or synthesized de novo using recombinant DNA methodologies.

Antibodies also include single-armed composite monoclonal antibodies, single chain antibodies, including single chain Fv (sFv) antibodies in which a variable heavy and a variable light chain are joined together (directly or through a peptide linker) to form a continuous polypeptide, as well as diabodies, tribodies, and tetrabodies (Pack et al. (1995) J Mol Biol 246:28; Biotechnol 11: 1271; and Biochemistry 31:1579). The antibodies are, e.g., polyclonal, monoclonal, chimeric, humanized, single chain, Fab fragments, fragments produced by an Fab expression library, or the like.

The term “epitope” generally refers to a peptide or polypeptide determinant capable of specifically binding to an antibody. Epitopes usually consist of chemically active surface groupings of molecules such as amino acids or sugar side chains and usually have specific three-dimensional structural characteristics, as well as specific charge characteristics. Conformational and nonconformational epitopes are distinguished in that the binding to the former but not the latter is lost in the presence of denaturing solvents.

An “antigen-binding fragment” of an antibody is a peptide or polypeptide fragment of the antibody that binds or selectively binds an antigen. An antigen-binding site is formed by those amino acids of the antibody that contribute to, are involved in, or affect the binding of the antigen. See Scott, T. A. and Mercer, E. I., Concise Encyclopedia: Biochemistry and Molecular Biology (de Gruyter, 3d ed. 1997), and Watson, J. D. et al., Recombinant DNA (2d ed. 1992) [hereinafter “Watson, Recombinant DNA”], each of which is incorporated herein by reference in its entirety for all purposes.

The term “screening” describes, in general, a process that identifies optimal or optimized molecules. Several properties of the respective molecules can be used in selection and screening including, for example, ability to induce a desired immune response in a test system. Selection is a form of screening in which identification and physical separation are achieved simultaneously by expression of a selection marker, which, in some genetic circumstances, allows cells expressing the marker to survive while other cells die (or vice versa). Because of limitations in studying primary immune responses in vitro, in vivo studies are particularly useful screening methods. In one aspect, screening refers to a process that identifies a polypeptide (or a nucleic acid encoding such polypeptide), wherein the polypeptide induces or is capable of inducing an immune response to at least a portion of dengue viruses of at least one virus serotype in a subject, or cells of a subject, that is about equal to or greater than the immune response induced or capable of being induced by a reference polypeptide (e.g., wild-type polypeptide).

A “specific binding affinity” between two molecules, e.g., a ligand and a receptor, means a preferential binding of one molecule for another in a mixture of molecules. The binding of the molecules is typically considered specific if the binding affinity is about 1×10² M⁻¹ to about 1×10¹⁰ M⁻¹ (i.e., about 10⁻²-10⁻¹⁰ M) or greater, including about 10⁴ to 10⁶ M⁻¹, about 10⁶ to 10⁷ M⁻¹, or about 10⁸ M⁻¹ to 10⁹ M⁻¹ or 10¹⁰ M⁻¹.

“Avidity” refers to the tendency of an antibody to bind an antigen. The higher the avidity, the greater the affinity of the antibody for the antigen, the greater the binding of the antibody to the antigen, and the greater the stability of the antigen-antibody complex formed by binding of the antibody to the antigen.

The term “immunoassay” includes an assay that uses an antibody or immunogen to bind or specifically bind an antigen. The immunoassay may be characterized by the use of specific binding properties of a particular antibody to isolate, target, and/or quantify the antigen.

The invention provides a recombinant, synthetic, mutant, and/or isolated polypeptide comprising an immunogenic amino acid sequence that is substantially identical (e.g., at least about 75%, 80%, 85%, 86%, 87%, 88% or 89%, preferably at least about 90%, 91%, 92%, 93%, or 94%, and more preferably at least about 95% (e.g., about 87-95%), 96% 97%, 98%, 99%, 99.5% sequence identity) to an amino acid sequence of at least one of SEQ ID NOS:1-49, 65-116, 139-148, 153-155, 236-253, 343, and 345. In addition, the invention provides a recombinant, synthetic, mutant, and/or isolated polynucleotide that encodes an immunogenic amino acid sequence that is substantially identical to a nucleic acid sequence of at least one of SEQ ID NOS:156-218, 235, 254-271, 285-330, 342, and 344. As applied to polypeptides, the term “substantial identity” means that two or more amino acid sequences, when optimally aligned, such as by GAP or BESTFIT programs using default gap weights, by visual inspection, or any other suitable technique such as the sequence analysis and identity algorithms further describe herein, share at least about 60%, typically at least about 65%, usually at least about 70%, often at least about 75%, usually at least about 80%, at least about 85%, about 86%, about 87%, about 88%, or about 89%, and preferably at least about 90%, or more (e.g., at least about 91%, about 92%, about 93%, about 94%, about 95%, about 96%, about 97%, about 97.5%, about 98%, about 98.5%, about 99%, or about 99.5% or more) amino acid sequence identity. Similarly, as applied to nucleic acids, the term substantial identity or substantial similarity means that the two or more nucleic acid sequences, when optimally aligned, such as by the programs BLAST, GAP or BESTFIT using default gap weights (described in detail below along with other suitable programs and techniques for assessing nucleic acid sequence identity levels), or by visual inspection, share at least about 60% nucleic acid sequence identity or sequence similarity, at least about 70% or at least about 75% sequence identity or sequence similarity, more desirably at least about 80 or about 85% nucleic acid sequence identity or sequence similarity; preferably at least about 90% nucleic acid sequence identity or sequence similarity, and more preferably at least about 95% nucleic acid sequence identity or sequence similarity (including, e.g., about 90, 91, 92, 93, 94, 95, 96, 97, 98, 98.5, 99, 99.5, or more percent nucleotide sequence identity or sequence similarity).

“Identity” (sometimes referred to as “overall identity”—in contrast to “local identity,” which is discussed further herein) with respect to amino acid or nucleotide sequences refers to the percentage of amino acid residues or nucleotide bases, respectively, that are identical in the two amino acid or nucleotide sequences when two such amino acid sequences or two such nucleotide sequences are optimally aligned with one another. If, in the optimal alignment, a position in a first sequence is occupied by the same amino acid residue or nucleotide residue as the corresponding position in the second corresponding amino acid or nucleotide sequence, the sequences exhibit identity with respect to that residue position. The level of identity between two sequences (or “percent sequence identity”) is measured as a ratio of the number of identical positions shared by the sequences with respect to the size of the sequences analyzed (i.e., percent sequence identity=(number of identical positions/total number of positions)×100).

The “optimal alignment” is the alignment that provides the highest identity between the aligned sequences. In obtaining the optimal alignment, gaps can be introduced, and some amount of non-identical sequences and/or ambiguous sequences can be ignored. Preferably, if a gap needs to be inserted into a first sequence to achieve the optimal alignment, the percent identity is calculated using only the residues that are paired with a corresponding amino acid residue (i.e., the calculation does not consider residues in the second sequences that are in the “gap” of the first sequence). However, it is often preferable that the introduction of gaps and/or the ignoring of non-homologous/ambiguous sequences are associated with a “gap penalty.”

While identity between relatively short amino acid or nucleic acid sequences can be easily determined by visual inspection, analysis with an appropriate algorithm, typically facilitated through computer software, commonly is used to determine identity between longer sequences. When using a sequence comparison algorithm, test and reference sequences typically are input into a computer, subsequence coordinates are designated, if necessary, and sequence algorithm program parameters are designated. The sequence comparison algorithm then calculates the percent sequence identity for the test sequence(s) relative to the reference sequence, based on the designated program parameters. A number of mathematical algorithms for rapidly obtaining the optimal alignment and calculating identity between two or more sequences are known and incorporated into a number of available software programs. Examples of such programs include the MATCH-BOX, MULTAIN, GCG, FASTA, and ROBUST programs for amino acid sequence analysis, and the SIM, GAP, NAP, LAP2, GAP2, and PIPMAKER programs for nucleotide sequences. Preferred software analysis programs for both amino acid and polynucleotide sequence analysis include the ALIGN, CLUSTALW (e.g., version 1.6 and later versions thereof), and BLAST programs (e.g., BLAST 2.1, BL2SEQ, and later versions thereof). Select examples of which are further described in the following paragraphs.

For amino acid sequence analysis, a weight matrix, such as the BLOSUM matrixes (e.g., the BLOSUM45, BLOSUM50, BLOSUM62, and BLOSUM80 matrixes—as described in, e.g., Henikoff and Henikoff (1992) Proc Natl Acad Sci USA 89:10915-10919), Gonnet matrixes (e.g., the Gonnet40, Gonnet80, Gonnet120, Gonnet160, Gonnet250, and Gonnet350 matrixes), or PAM matrixes (e.g., the PAM30, PAM70, PAM120, PAM160, PAM250, and PAM350 matrixes), are used in determining identity. BLOSUM matrixes are preferred. The BLOSUM50 and BLOSUM62 matrixes are typically most preferred. In the absence of availability of such weight matrixes (e.g., in nucleic acid sequence analysis and with some amino acid analysis programs), a scoring pattern for residue/nucleotide matches and mismatches can be used (e.g., a +5 for a match and −4 for a mismatch pattern).

The ALIGN program produces an optimal global (overall) alignment of the two chosen protein or nucleic acid sequences using a modification of the dynamic programming algorithm described by Myers and Miller (1988) CABIOS 4:11-17. Preferably, if available, the ALIGN program is used with weighted end-gaps. If gap opening and gap extension penalties are available, they are preferably set between about −5 to −15 and 0 to −3, respectively, more preferably about −12 and −0.5 to −2, respectively, for amino acid sequence alignments, and −10 to −20 and −3 to −5, respectively, more preferably about −16 and −4, respectively, for nucleic acid sequence alignments. The ALIGN program and principles underlying it are further described in, e.g., Pearson et al. (1988) Proc Natl Acad Sci USA 85:2444-48, and Pearson et al. (1990) Methods Enzymol 18:63-98.

Alternatively, and particularly for multiple sequence analysis (i.e., comparison of more than three sequences), the CLUSTALW program (described in, e.g., Thompson, J. D. et al. (1994) Nuc Acids Res 22:4673-4680) can be used. In one aspect, Gap open and Gap extension penalties are set at 10 and 0.05, respectively. Alternatively or additionally, the CLUSTALW program is run using “dynamic” (versus “fast”) settings. Preferably, nucleotide sequence analysis with CLUSTALW is performed using the BESTFIT matrix, whereas amino acid sequences are evaluated using a variable set of BLOSUM matrixes depending on the level of identity between the sequences (e.g., as used by the CLUSTALW version 1.6 program available through the San Diego Supercomputer Center (SDSC)). Preferably, the CLUSTALW settings are set to the SDSC CLUSTALW default settings (e.g., with respect to special hydrophilic gap penalties in amino acid sequence analysis). The CLUSTALW program and underlying principles of operation are further described in, e.g., Higgins et al. CABIOS, (1992) 8(2): 189-91, Thompson et al. (1994) Nucleic Acids Res 22:4673-80, and Jeanmougin et al. (1998) Trends Biochem Sci 2:403-07.

Another useful algorithm for determining percent identity is the FASTA algorithm, which is described in Pearson, W. R. & Lipman, D. J. (1988) Proc Natl Acad Sci USA 85:2444. See also, W. R. Pearson (1996) Methods Enzymol 266:227-258. Preferred parameters used in a FASTA alignment of DNA sequences to calculate percent identity are optimized, BL50 Matrix 15: −5, k-tuple=2; joining penalty=40, optimization=28; gap penalty=12, gap length penalty=−2; and width=16.

Other preferred algorithms include the BLAST and BLAST 2.0 algorithms, which facilitate analysis of at least two amino acid or nucleotide sequences, by aligning a selected sequence against multiple sequences in a database (e.g., GenSeq), or, when modified by an additional algorithm such as BL2SEQ, between two selected sequences. Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information (NCBI) (“http” colon, slash, slash, “www” dot “ncbi” dot “nlm” dot “nih” dot “gov” slash). The BLAST algorithm parameters W, T, and X determine the sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) can be used with a word length (W) of 11, an expectation (E) of 10, M=5, N=−4 and a comparison of both strands. For amino acid sequences, the BLASTP program (e.g., BLASTP 2.0.14; Jun, 29, 2000) can be used with a word length of 3 and an expectation (E) of 10.

BLAST program analysis also or alternatively is preferably modified by low complexity filtering programs such as the DUST or SEG programs, which are preferably integrated into the BLAST program operations (see, e.g., Wootton et al. (1993) Comput Chem 17:149-63, Altschul et al. (1991) Nat Genet 6:119-29, Hancock et al. (1991) Comput Appl Biosci 10:67-70, and Wootton et al. (1996) Meth Enzymol 266:554-71). In such aspects, if a lambda ratio is used, preferred settings for the ratio are between 0.75 and 0.95, more preferably between 0.8 and 0.9. If gap existence costs (or gap scores) are used in such aspects, the gap existence cost preferably is set between about −5 and −15, more preferably about −10, and the per residue gap cost preferably is set between about 0 to −5, more preferably between 0 and −3 (e.g., −0.5). Similar gap parameters can be used with other programs as appropriate. The BLAST programs and principles underlying them are further described in, e.g., Altschul et al. (1990) J Mol Biol 215:403-10, Karlin and Altschul (1990) Proc Natl Acad Sci USA, 87:2264-68 (as modified by Karlin and Altschul (1993) Proc Natl Acad Sci USA 90:5873-77), and Altschul et al. (1997) Nucl Acids Res 25:3389-3402.

Another example of a useful algorithm is incorporated in PILEUP software. The PILEUP program creates a multiple sequence alignment from a group of related sequences using progressive, pair-wise alignments to show relationship and percent sequence identity or percent sequence similarity. PILEUP uses a simplification of the progressive alignment method of Feng & Doolittle (1987) J Mol Evol 35:351-360, which is similar to the method described by Higgins & Sharp (1989) CABIOS 5:151-153. Preferred parameters for the PILEUP program are: default gap weight (3.00), default gap length weight (0.10), and weighted end gaps. PILEUP is a component of the GCG sequence analysis software package, e.g., version 7.0 (see, e.g., Devereaux et al. (1984) Nuc Acids Res 12:387-395).

Other useful algorithms for performing identity analysis include the local homology algorithm of Smith and Waterman (1981) Adv Appl Math 2:482, the homology alignment algorithm of Needleman and Wunsch (1970) J Mol Biol 48:443, and the search for similarity method of Pearson and Lipman (1988) Proc Natl Acad Sci USA 85:2444. Computerized implementations of these algorithms (e.g., GAP, BESTFIT, and TFASTA) are provided in the Wisconsin Genetics Software Package Release 7.0, Genetics Computer Group, 575 Science Dr., Madison, Wis.

Several additional commercially available software suites incorporate the ALIGN, BLAST, and CLUSTALW programs and similar functions, and may include significant improvements in settings and analysis. Examples of such programs include GCG suite of programs and those available through DNASTAR, Inc. (Madison, Wis.), such as Lasergene® and Protean® programs. A preferred alignment method is the Jotun Hein method, incorporated within the MegaLine™ DNASTAR package (MegaLine™ Version 4.03) used according to the manufacturer's instructions and default values specified in the program.

Because various algorithms, matrixes, and programs are commonly used to analyze sequences, amino acid and polynucleotide sequences are preferably characterized in terms of approximate identities by indicating a range of identity “about” a particular identity, (e.g., +/−10%, more preferably +/−8%, and even more preferably +/−5% of the particular identity). Nonetheless, an exact identity can be measured by using only one of the aforementioned programs, such as a BLAST program described herein or the Hein method.

In one aspect, the invention provides a recombinant, synthetic, and/or isolated polypeptide (which may be simply referred to as the polypeptide or recombinant polypeptide) which comprises an amino acid sequence that has at least about 90% amino acid sequence identity to an amino acid sequence of at least one of SEQ ID NOS:1-49, 65-116, 139-148, 153-155, 236-253, 343, and 345. Desirably, the recombinant polypeptide comprises a sequence that has at least about 90% amino acid sequence identity to at least two sequences selected from the group of SEQ ID NOS:1-49, 65-116, 139-148, 153-155, 236-253, 343, and 345. Favorably, the recombinant polypeptide comprises a sequence that has at least about 90% amino acid sequence identity to at least five sequences selected from the group of SEQ ID NOS:1-49, 65-116, 139-148, 153-155, 236-253, 343, and 345. Advantageously, the recombinant polypeptide comprises a sequence that has at least about 90% amino acid sequence identity to at least about ten, preferably at least about fifteen, and more preferably at least about twenty sequences selected from the group of SEQ ID NOS:1-49, 65-116, 139-148, 153-155, 236-253, 343, and 345. Preferably, the recombinant polypeptide comprises an amino acid sequence that has at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99% identity to at least one, preferably at least five, and more preferably at least about ten sequences selected from any of SEQ ID NOS:1-49, 65-116, 139-148, 153-155, 236-253, 343, and 345.

A recombinant polypeptide that comprises an amino acid selected from the group of SEQ ID NOS:1-49, 65-116, 139-148, 153-155, 236-253, 343, and 345 is preferred. The polypeptide can comprise any number of suitable additional amino acid sequences, such as, e.g., additional sequences described elsewhere herein (e.g., signal sequence and/or purification-facilitating epitope tag (e.g., Whitehorn et al., Biotechnology 13:1215-19 (1995)), or the polypeptide can consist essentially entirely of an amino acid sequence selected from the group of SEQ ID NOS:1-49, 65-116, 139-148, 153-155, 236-253, 343, and 345.

Alternatively, but more typically additionally, the recombinant polypeptide can comprise an amino acid sequence that has substantial functional homology to the immunogenic amino acid sequence of any polypeptide of the invention, such as the amino acid sequence of any one of SEQ ID NOS:1-49, 65-116, 139-148, 153-155, 236-253, 343, and 345. “Substantial functional homology” means that the analyzed amino acid sequences share at least about 60%, typically at least about 65%, usually at least about 70%, often at least about 75%, preferably at least about 80%, more preferably at least about 85%, and even more preferably at least about 90%, or more (e.g., 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 99.5% or more) functionally homologous residues in the optimal homology alignment. The “optimal functional homology alignment” is the alignment that provides the highest level of homology between two amino acid sequences, using the principles described above with respect to the “optimal alignment.” Conservative amino acid residue substitutions involve exchanging a member within one class of amino acid residues for a residue that belongs to the same class (identical amino acid residues are considered functionally homologous or conserved in calculating percent functional homology). The classes of amino acids and the members of those classes are presented in Table 1.

TABLE 1 Amino Acid Residue Classes Amino Acid Class Amino Acid Residues Acidic Residues ASP and GLU Basic Residues LYS, ARG, and HIS Hydrophilic Uncharged Residues SER, THR, ASN, and GLN Aliphatic Uncharged Residues GLY, ALA, VAL, LEU, and ILE Non-polar Uncharged Residues CYS, MET, and PRO Aromatic Residues PHE, TYR, and TRP

An alternative set of conservative amino acid substitutions, delineated by six conservation groups, is provided in Table 2.

TABLE 2 Alternative Amino Acid Residue Substitution Groups 1 Alanine (A) Serine (S) Threonine (T) 2 Aspartic acid (D) Glutamic acid (E) 3 Asparagine (N) Glutamine (Q) 4 Arginine (R) Lysine (K) 5 Isoleucine (I) Leucine (L) Methionine (M) 6 Phenylalanine (F) Tyrosine (Y) Tryptophan (W)

More conservative substitutions exist within the above-described classes and can be alternatively preferred. An example of conservation groups for more conservative substitutions include: valine-leucine-isoleucine, phenylalanine-tyrosine, lysine-arginine, alanine-valine, and asparagine-glutamine. Additional groups of amino acids can also be formulated using the principles described in, e.g., Creighton (1984) Proteins: Structure and Molecular Properties (2d Ed. 1993), W.H. Freeman and Company.

Typically, one or more amino acid residues in the immunogenic amino acid sequence of the polypeptide that are not identical to a corresponding residue in at least one of the immunogenic amino acid sequences disclosed herein, such as, e.g., SEQ ID NOS:1-49 and 153-155, usually differ from the most related immunogenic amino acid sequence (e.g., the most related sequence selected from the group of SEQ ID NOS:1-49 and 153-155) by conservative amino acid substitutions, i.e., substitutions with one of more of the groups provided in Table 1 or Table 2 above, or, typically, substitutions that are within a single group in each such table). As such, the disclosure of a polypeptide or protein sequence herein, in conjunction with the above-described conservation groups, provides an express listing of all conservatively substituted polypeptide sequences relating to these sequences.

Typically, the immunogenic amino acid sequence of the polypeptide also exhibits substantial weight homology to one of the immunogenic amino acid sequences of the invention, commonly to at least one of the group of SEQ ID NOS:1-49, 65-116, 139-148, 153-155, 236-253, 343, and 345. Desirably, the immunogenic amino acid sequence has substantially high weight homology with at least 5, preferably at least about 10, or more, of the disclosed immunogenic amino acid sequences of the invention (e.g., about 5, 10 or more sequences selected from any of the above-referenced group of sequences). “Substantial weight homology” means that at least about 60%, preferably at least about 70%, and more preferably at least about 80% (e.g., about 65-85%), or more (e.g., about 87%, 90%, 92%, 95%, or 99%) of the non-identical amino acid residues at a position in the polypeptide are members of the same weight-based “weak conservation group” or “strong conservation group” as the corresponding amino acid in the most identical or functionally homologous sequence among the disclosed immunogenic amino acid sequences of the invention, such as an amino acid sequence selected from the group of and SEQ ID NOS:1-49, 65-116, 139-148, 153-155, 236-253, 343, and 345. Strong group conservation is preferred. Weight-based conservation is determined on the basis of whether the non-identical corresponding amino acid is associated with a positive score on one of the weight-based matrices described herein (e.g., the BLOSUM50 matrix and preferably the PAM250 matrix). Weight-based strong conservation groups include Ser Thr Ala, Asn Glu Gln Lys, Asn His Gln Lys, Asn Asp Glu Gin, Gln His Arg Lys, Met Ile Leu Val, Met Ile Leu Phe, His Tyr, and Phe Tyr Trp. Weight-based weak conservation groups include Cys Ser Ala, Ala Thr Val, Ser Ala Gly, Ser Thr Asn Lys, Ser Thr Pro Ala, Ser Gly Asn Asp, Ser Asn Asp Glu Gln Lys, Asn Asp Glu Gln His Lys, Asn Glu Gln His Arg Lys, Phe Val Leu Ile Met, and His Phe Tyr. The CLUSTALW sequence analysis program provides analysis of weight-based strong conservation and weak conservation groups in its output, and offers the preferred technique for determining weight-based conservation, preferably using the CLUSTALW default settings used by the SDSC.

Alternatively, but typically in addition to either substantial identity or substantial functional homology, the polypeptide comprises an immunogenic amino acid sequence that shares a similar hydropathy profile (or exhibits similar hydrophilicity) to at least one (preferably at least 5, and more preferably at least about 10) of the immunogenic amino acid sequences disclosed herein, such as amino acid sequences selected from the group of SEQ ID NOS:1-49. A hydropathy profile can be determined using the Kyte & Doolittle index, the scores for each naturally occurring amino acid in the index being as follows: I (+4.5), V (+4.2), L (+3.8), F (+2.8), C (+2.5), M (+1.9); A (+1.8), G (−0.4), T (−0.7), S (−0.8), W (−0.9), Y (−1.3), P (−1.6), H (−3.2); E (−3.5), Q (−3.5), D (−3.5), N (−3.5), K (−3.9), and R (−4.5) (see, e.g., U.S. Pat. No. 4,554,101 and Kyte & Doolittle, (1982) J Molec Biol 157:105-32 for further discussion). Preferably, at least about 45%, preferably at least about 60%, and more preferably at least about 75% (e.g., at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97% at least about 98%, or at least about 99%) of the amino acid residues in the immunogenic amino acid sequence that are not identical to the corresponding residues in the most identical or functionally homologous immunogenic amino acid sequence disclosed herein (“most related homolog”), which homolog is preferably selected from any of SEQ ID NOS:1-49, exhibit less than a +/−2 change in hydrophilicity, more preferably less than a +/−1 change in hydrophilicity, and even more preferably less than a +/−0.5 change in hydrophilicity with respect to the non-identical amino acid residue at the corresponding position in the most related homolog. Overall, the polypeptide desirably exhibits a total change in hydrophilicity, with respect to its most related homolog selected from the group of SEQ ID NOS:1-49, 65-116, 139-148, 153-155, and 236-253, of less than about 150, more preferably less than about 100, and even more preferably less than about 50 (e.g., less than about 30, less than about 20, or less than about 10). Examples of typical amino acid substitutions that retain similar or identical hydrophilicity include arginine-lysine substitutions, glutamate-aspartate substitutions, serine-threonine substitutions, glutamine-asparagine substitutions, and valine-leucine-isoleucine substitutions. Algorithms and software, such as the GREASE program available through the SDSC, provide a convenient way for quickly assessing the hydropathy profile of a peptide fragment or peptide portion.

The polypeptide desirably comprises a substantially identical (e.g., having at least about 75%, 80%, 85%, 86%, 87%, 88% or 89%, preferably at least about 90%, 91%, 92%, 93%, or 94%, and more preferably at least about 95% (e.g., about 87-95%), 96% 97%, 98%, 99%, 99.5% sequence identity), or at least substantially functionally homologous, immunogenic amino acid sequence to at least one sequence (preferably at least 5 sequences, and more preferably at least about 10 sequences) selected from the group of SEQ ID NOS:1-49, 65-116, 139-148, 153-155, and 236-253, wherein at least about 90%, preferably at least about 95%, and more preferably 100% of the amino acid residues in the composition have a Kyte & Doolittle hydropathy score of above 0, and more preferably of at least about 1.

Recombinant Truncated E Protein and Full Length E Protein Polypeptides

The invention also provides a recombinant polypeptide comprising an amino acid sequence that has that has at least about 75%, 80%, 85%, 86%, 87%, 88% or 89%, preferably at least about 90%, 91%, 92%, 93%, or 94%, and more preferably at least about 95% (e.g., about 87-95%), 96% 97%, 98%, 99%, 99.5%, or more amino acid sequence identity to an amino acid sequence of at least one of SEQ ID NOS:1-49 and 153-155. Such a polypeptide is usually referred to as a recombinant truncated envelope (E) protein polypeptide (or “truncated E” or “tE” polypeptide) as described above. The invention also provides recombinant E polypeptides of the invention that have a sequence length equivalent to or substantially equivalent to (e.g., within about 85%, 87%, 88%, 90%, 92% or more of) the length of the amino acid sequence of an envelope protein of a wild-type flavivirus, e.g., preferably a dengue virus envelope protein. Such a polypeptide is usually referred to as a recombinant full length E protein polypeptide (“full length E,” “full E” or “E” polypeptide).

In another aspect, the invention provides a polypeptide that comprises an amino acid sequence that is substantially identical to at least one of SEQ ID NOS:2, 3, 5, 25, 29, and 44-46. Such a polypeptide comprises an amino acid sequence that has at least about 75%, 80%, 85%, 86%, 87%, 88% or 89%, preferably at least about 90%, 91%, 92%, 93%, or 94%, and more preferably at least about 95% (e.g., about 87-95%), 96% 97%, 98%, 99%, 99.5%, or more amino acid sequence identity with at least one of SEQ ID NOS:2, 3, 5, 25, 29, and 44-46. Desirably, the polypeptide comprises an amino acid sequence that has at least about 85%, at least about 90%, at least about 95%, or more amino acid sequence identity with 3, 5, or more sequences selected from any of SEQ ID NOS:2, 3, 5, 25, 29, and 44-46. In one preferred aspect, the polypeptide can comprise, consist essentially of, or consist entirely of an amino acid sequence according of any one of SEQ ID NOS:2, 3, 5, 25, 29, and 44-46.

In a particular aspect, the invention provides a polypeptide that comprises an immunogenic amino acid sequence that is substantially identical (e.g., having at least about 75%, 80%, 85%, 86%, 87%, 88% or 89%, preferably at least about 90%, 91%, 92%, 93%, or 94%, and more preferably at least about 95% (e.g., about 87-95%), 96% 97%, 98%, 99%, 99.5% sequence identity) to SEQ ID NOS:2, 5 or 25. Desirably, the amino acid sequence has at least about 85%, more preferably at least about 90%, and even more preferably at least about 95%, or more (e.g., 97%, 98%, or 99.5%) sequence identity with SEQ ID NO:5. The invention includes a polypeptides comprising SEQ ID NO:2, 5, or 25.

Some such recombinant truncated E polypeptides or recombinant full length E polypeptides of the invention polypeptide induce, promote, or enhance an immune response in a subject (e.g., mammal), or population of cells of a subject, against at least one dengue virus of at least one serotype selected from the group of dengue-1, dengue-2, dengue-3, and dengue-4. Some such polypeptides induce an immune response in a subject against at least one dengue virus of each of at least two, three, or four serotypes selected from the group of dengue-1, dengue-2, dengue-3, and dengue-4.

Some such recombinant truncated E or full length E polypeptides induce an immune response in a subject at least one dengue virus of at least one serotype selected from the group of dengue-1, dengue-2, dengue-3, and dengue-4 that is about equal to or greater than an immune response induced in the subject against the at least one dengue virus of the at least one serotype by a WT truncated E protein of each said at least one dengue virus, wherein said WT truncated E protein has an amino acid sequence length substantially equivalent to that of the recombinant or synthetic polypeptide. The WT truncated E proteins of dengue-1, dengue-2, dengue-3, and dengue-4 can comprise the amino acid sequences consisting essentially of SEQ ID NO:338, 339, 340, and 341, respectively.

Some such polypeptides induce an immune response in a subject, or population of cells of the subject, against at least one dengue virus of each of at least two or three serotypes that is about equal to or greater than an immune response induced in the subject or cells by a WT truncated E protein of at least one dengue virus of each of the at least two or three serotypes, respectively, against at least one dengue virus of each of said at least three serotypes.

Preferably, a recombinant truncated E polypeptide induces an immune response in the subject or cells thereof against at least one dengue virus of each of the four serotypes that is about equal to or greater than an immune response induced in the subjects or its cells by any of SEQ ID NOS:338-341 against at least one dengue virus of each of the four serotypes.

Some such recombinant truncated E and full length E polypeptides induce production of one or more antibodies that bind to at least one dengue virus of at least one dengue virus serotype. Preferably, such a polypeptide induce production of one or more antibodies that bind to at least one dengue virus of each of at least two, more preferably three, or even more preferably four serotypes. In one particular aspect, a recombinant truncated E or full length E polypeptide induces production of a number of antibodies that bind to at least one dengue virus of at least one, two, three, or serotypes that is about equal to or greater than the number of antibodies induced by a wild-type truncated E protein or full length E protein of the at least one dengue virus of the at least one serotype, respectively. In a preferred aspect, a recombinant truncated E or full length E polypeptide induces production of a number of antibodies that bind to at least one dengue virus of each of at least one, preferably at least two, more preferably at least three, and even more preferably at least four serotypes that is about equal to or greater than the number of antibodies induced by a wild-type truncated E protein or full length E protein of the at least one dengue virus of each of the at least one, two, three or four serotypes, respectively.

Some such recombinant truncated E and full length E polypeptides induce the production of one or more antibodies that bind more specifically to at least one dengue virus of the at least one serotype than is induced by a wild-type truncated E or full length E protein, respectively, of the at least one dengue virus of the at least one serotype.

In another aspect, some such recombinant truncated E or full length E polypeptide of the invention induce or produce a titer of neutralizing antibodies against at least one dengue virus of each of at least one, preferably at least two, more preferably at least three, and even more preferably at least four dengue virus serotypes. Some such polypeptides induce or produce a titer of neutralizing antibodies against at least one dengue virus of at least one serotype that is about equal to or greater than a titer of neutralizing antibodies produced against the at least one dengue virus of the at least one serotype by a wild-type truncated E protein of the at least one dengue virus of the at least one serotype, wherein each said wild-type truncated E protein is selected from the group of SEQ ID NOS:338-341. Some such polypeptides induce or produce a titer of neutralizing antibodies against at least one dengue virus of each of at least two, at least three, or at least four serotypes that is about equal to or greater than a titer of neutralizing antibodies produced against the at least one dengue virus of each of the at least two, at least three, or at least four serotypes by a wild-type truncated E protein of the at least one dengue virus of each of the at least two, at least three, or at least four serotypes, respectively, wherein each said wild-type truncated E protein is selected from the group of SEQ ID NOS:338-341.

The invention provides recombinant truncated E polypeptides and recombinant full length E proteins having any of the aforementioned characteristics and characteristics described herein (or any combination of such characteristics), in addition to polypeptides having the additional or alternative characteristics attendant polypeptides of the invention as described further herein.

A polypeptide of the invention can comprise an immunogenic amino acid sequence of any length (e.g., about 10-1500 amino acids, more typically about 50-1000 amino acids, and even more frequently about 100-800 amino acids). Typically, the immunogenic amino acid sequence of the invention is at least about 100, more typically at least about 150, frequently about 200, more frequently about 250, usually at least about 300, more usually at least about 350, and even more usually (and typically preferably) at least about 400 amino acids in length (e.g., about 400-750 amino acids in length, more typically about 425-685 amino acids in length for recombinant truncated E protein polypeptides, and commonly about 425 to about 450 amino acids in length for recombinant truncated E protein polypeptides of the invention, about 435 to about 460 or about 465 for recombinant PRM15/truncated E polypeptides of the invention, and about 650 to about 680 or about 700 amino acids in length for C15/full prM/full E polypeptides of the invention, as are further discussed above and below).

Signal Peptide Sequences

Recombinant truncated E polypeptides and recombinant full length E polypeptides of the invention can also or alternatively comprise any suitable number and type of additional amino acid sequences, such as one or more peptide fragments. In one embodiment, for example, such truncated E or full length E polypeptide further comprises a signal peptide. Generally, the signal peptide directs the recombinant or synthetic polypeptide to the endoplasmic reticulum when the recombinant or synthetic polypeptide is expressed in an animal cell. The inclusion of a signal sequence, which typically directs organelle trafficking and/or secretion of at least a portion of the polypeptide upon expression in a cell is particularly preferred. Such sequences are typically present in the immature (i.e., not fully processed) form of the polypeptide, and are subsequently removed/degraded by cellular proteases to arrive at the mature form of the protein. For example, the truncated E or full length E polypeptide can include any suitable signal sequence or combinations of signal sequences that direct the polypeptide to intracellular compartments, such as a sequence that directs the polypeptide to be transported (e.g., translocated) into (preferably such that the protein is processed by and released from) the endoplasmic reticulum or secretory pathway (e.g., the ER, golgi, and other secretory related organelles and cellular compartments), the nucleus, and/or which directs the polypeptide to be secreted from the cell, translocated in a cellular membrane, or target a second cell apart from the cell the protein is secreted from. In this respect, the polypeptide can include an intracellular targeting sequence (or “sorting signal”) that directs the polypeptide to an endosomal and/or lysosomal compartment(s) or other compartment rich in MHC II to promote CD4+ and/or CD8+ T cell presentation and response, such as a lysosomal/endosomal-targeting sorting signal derived from lysosomal associated membrane protein 1 (e.g., LAMP-1—see, e.g., Wu et al. Proc. Natl. Acad. Sci. USA 92:1161-75 (1995) and Ravipraskash et al., Virology 290:74-82 (2001)), a portion or homolog thereof (see, e.g., U.S. Pat. No. 5,633,234), or other suitable lysosomal, endosomal, and/or ER targeting sequence (see, e.g., U.S. Pat. No. 6,248,565). In some aspects, it may desirable for the intracellular targeting sequence to be located near or adjacent to a proven/identified anti-dengue virus T-cell epitope sequence(s) within the polypeptide, which can be identified by techniques known in the art and described herein, thereby increasing the likelihood of T cell presentation of polypeptide fragments that comprise such epitope(s). Preferably, such polypeptides are expressed from recombinant, synthetic, mutant and/or isolated DNA or RNA delivered to a host cell by one or more of the nucleotide or viral nucleotide transfer vectors, including, e.g., one or more of the gene transfer vectors, described further herein.

Preferably, the polypeptide comprises a signal sequence that directs the polypeptide to the endoplasmic reticulum (ER) (e.g., facilitates ER translocation of the polypeptide) when the polypeptide is expressed in a mammalian cell. The polypeptide can comprise any suitable ER-targeting sequence. Many ER-targeting sequences are known in the art. Examples of such signal sequences are described in U.S. Pat. No. 5,846,540. Commonly employed ER/secretion signal sequences include the STII or Ipp signal sequences of E. coli, yeast alpha factor signal sequence, and mammalian viral signal sequences such as herpes virus gD signal sequence. Further examples of signal sequences are described in, e.g., U.S. Pat. Nos. 4,690,898, 5,284,768, 5,580,758, 5,652,139, and 5,932,445. Suitable signal sequences can be identified using skill known in the art. For example, the SignalP program (described in, e.g., Nielsen et al. (1997) Protein Engineering 10:1-6), which is publicly available through the Center for Biological Sequence Analysis at “http” colon, slash, slash, “www” dot “cbs” dot “dtu” dot “dk” slash “services” slash “SignalP”, or similar sequence analysis software capable of identifying signal-sequence-like domains can be used. Related techniques for identifying suitable signal peptides are provided in Nielsen et al., Protein Eng. 10(1): 1-6 (1997). Sequences can be manually analyzed for features commonly associated with signal sequences, as described in, e.g., European Patent Application 0 621 337, Zheng and Nicchitta (1999) J Biol Chem 274(51): 36623-30, and Ng et al. (1996) J Cell Biol 134(2):269-78.

Recombinant truncated polypeptides having the above-described characteristics typically comprise an immunogenic amino acid sequence that is shorter in amino acid length than the dengue virus envelope protein; that is, the immunogenic amino acid sequence comprises one or more residues less than the total number of residues of a dengue virus envelope protein. Particularly, the immunogenic amino acid of such proteins is typically about 65-95%, and more typically about 80-90%, of the size of a dengue virus envelope protein (determined by number of residues in the respective proteins), preferably in combination with an ER-targeting signal sequence that usually has a size equal to about 5-20% of a dengue virus prM sequence (useful fragments of such amino acid sequences also provided by the invention are discussed further herein). Such truncated E polypeptides often lack at least a portion of the dengue virus E protein C-terminal transmembrane sequence or a functional and/or structural homolog thereof. Proteins having such characteristics can exhibit different secretion qualities than a protein that comprise such a sequence.

The invention also provides recombinant polypeptides comprising an immunogenic amino acid sequence that is equivalent to or similar in length to a complete or full length flavivirus envelope protein, preferably a dengue virus envelope protein. Such amino acid sequences can be referred to as “partial full length” and “full length” E polypeptide sequences, respectively, in contrast to the above-described “truncated” E sequences that comprise an immunogenic amino acid sequence that is equivalent to or similar in length to a truncated envelope protein of a flavivirus, such as a dengue virus. In addition, as described below, the invention also provides polypeptides comprising such a full length immunogenic sequence that further includes a signal peptide sequence.

In one embodiment, a recombinant truncated E polypeptide or full length E polypeptide of the invention further comprises a signal peptide sequence, wherein the signal peptide sequence comprises, or consists essentially of, an amino acid sequence of at least about 10 (e.g., about 8-20) amino acid residues in length that has at least about 50% (preferably at least about 60%, 65%, 70%, 80%, 85%, 90%, 95%, 98%, or more) and even more preferably at least about 85-95% amino acid sequence identity to the C-terminal 5-20% of a flavivirus prM protein sequence. For example, in one aspect, such signal peptide comprises the last 15 amino acid residues of the C terminus of the flavivirus prM protein sequence (e.g., dengue prM protein sequence). Any suitable flavivirus C-terminal prM sequence can be used as the basis for a signal peptide. The flaviviruses are discussed in, e.g., FIELDS VIROLOGY, supra, VIROLOGY, B. N. Fields et al., eds., Raven Press, Ltd., New York (3rd ed., 1996 and 4th ed., 2001) and the ENCYCLOPEDIA OF VIROLOGY, R. G. Webster et al., eds., Academic Press (2nd ed., 1999). Several flaviviral prM sequences also are known (see, e.g., Despres et al. (1990) Virus Res 16(1):59-75, Venugopal et al. (1995) Vaccine August; 13(11):1000-5, International Patent Application WO 01/39802, and GenBank Accession Nos. AAK97602, AAD28623, GNWVTB, BAA23792, BAA23784, BAA08221, BAA08220 and AAF34187). The C-terminal portions of these and other flavivirus prM signal sequences or substantially identical homologs (e.g., having at least about 75%, 80%, 85%, 86%, 87%, 88% or 89%, preferably at least about 90%, 91%, 92%, 93%, or 94%, and more preferably at least about 95% (e.g., about 87-95%), 96% 97%, 98%, 99%, 99.5% sequence identity) thereof can be generated by standard DNA synthesis techniques (homologs can be generated through directed mutagenesis, recursive sequence recombination, rational sequence design, or any other suitable technique, examples of which are discussed further herein) and fused to sequences encoding the amino acid sequence (e.g., a sequence encoding the C-terminal-most 10-20 amino acids of a yellow fever virus, or homolog thereof, can be fused to a sequence encoding any one of SEQ ID NOS:1-49 and 153-155). Introduction of a start codon to the 5′ end of such a prM sequence typically adds an N-terminal methionine to the amino acid sequence when expressed in a mammalian cell (other modifications may occur in bacterial and/or other eukaryotic cells, such as introduction of an formyl-methionine residue at a start codon). The inventors contemplate the production and use of such N-terminal methionine sequences in most aspects where the polypeptide comprises or consists essentially of the amino acid sequence or at least the N-terminus thereof. Standard nucleic acid synthesis techniques are known in the art (see, e.g., Beaucage and Caruthers, Tetrahedron Let 22:1859-1869 (1981), Mathers et al., EMBO J. 3:801-805 (1984), Saiki et al., Science 239:487-491 (1988) U.S. Pat. No. 4,683,202, and other references cited herein).

In one aspect, the recombinant truncated E polypeptide or full length E polypeptide further comprises a signal peptide, which signal peptide comprises a signal sequence of about 5-25 amino acids (e.g., about 15 amino acids, typically about 10-20 amino acids) that has at least about 50% or at least about 60%, preferably at least about 70%, 80%, or 85% (e.g., at least about 65 to about 95%), and more preferably at least about 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% or more amino acid sequence identity to an amino acid sequence comprising the C-terminal most 15 amino acids of the prM protein selected from the group of DEN-1 prM, DEN-2 prM, DEN-3 prM, and DEN-4 prM or to an amino acid sequence comprising a methionine residue following by the C-terminal most 15 amino acids of the prM protein selected from the group of DEN-1 prM, DEN-2 prM, DEN-3 prM, and DEN-4 prM (SEQ ID NOS:52-55, respectively). Such amino acid sequences that comprise a methionine residue followed by 15 amino acid residues are typically termed a “PRM15” sequence. In another aspect, the signal peptide comprises a signal sequence that comprises an amino acid sequence of similar size that exhibits at least about 55% amino acid sequence identity (preferably at least about 65%, more preferably at least about 75%, and even more preferably at least about 85%, at least about 90%, at least about 95% (e.g., about 80-99%) to at least one of SEQ ID NOS:52-64. In one aspect, the signal peptide comprises a signal sequence selected from any of SEQ ID NOS:52-64. The positioning of the signal sequence within the recombinant truncated E polypeptide or full length E polypeptide depends on the type of signal sequence used. The functionality of signal sequences is often position dependent, with respect to the remainder of the protein. Preferably, the signal peptide sequence is positioned N-terminal to the immunogenic E polypeptide sequence, particularly where the signal sequence is a flaviviral prM sequence or homolog thereof. The signal peptide sequence can be incorporated in any suitable portion of the polypeptide that allows the signal sequence to carry out its desired targeting function. Typically and preferably, the signal peptide sequence is positioned near to (e.g., within about 20 amino acids or less) the recombinant immunogenic amino acid sequence (e.g., the truncated E polypeptide or full length E polypeptide), or, more desirably, is directly fused to the N-terminus of the immunogenic amino acid sequence. In some instances, other heterologous domains or linkers can be positioned between the signal sequence and the immunogenic E polypeptide. Inclusion of such elements is further discussed elsewhere herein.

PRM15/Truncated E Polypeptides and PRM15/Full Length E Polypeptides

Recombinant E truncated polypeptides comprising a dengue virus prM signal peptide sequence or homolog thereof (e.g., signal peptide sequence of any of SEQ ID NOS:52-64) are typically designated signal peptide/truncated E polypeptides, or signal peptide/full length E polypeptides. In a particular format, where the signal peptides comprises the 15 C-terminal amino acid residues of a flavivirus (e.g., dengue virus) prM protein, a recombinant polypeptide comprising such signal peptide and either a recombinant truncated E polypeptide or full length E polypeptide of the invention is termed, respectively, a PRM15/truncated E protein polypeptide (also termed “PRM15/truncated E” polypeptide or “PRM15/tE” polypeptide) or a PRM15/full length E protein polypeptide (also termed “PRM15/full length E” polypeptide or “PRM15/full E” polypeptide). The PRM15 sequence is typically fused to the first amino acid of the truncated E or full length E polypeptide. In addition, PRM15/tE and PRM15/full length E polypeptide sequences typically include a methionine residue as the first amino acid of the sequence.

Polypeptides of the invention comprising a recombinant truncated E polypeptide or full length E polypeptide of the invention and a signal sequence desirably comprise an amino acid sequence that has at least about 65%, at least about 70%, preferably at least about 80%, preferably at least about 85%, 87%, or 89%, more preferably at least about 90%, 91%, 92%, 93%, or 94%, and even more preferably at least about 95% amino acid sequence identity to at least one of SEQ ID NOS:65-116 (such identity taking into account both the prM fragment or portion, such as, e.g., PRM15 fragment, and the remaining, immunogenic amino acid sequence of the truncated E polypeptide). In one embodiment, such a polypeptide comprises an amino acid that has at least about 75% (e.g., about 80% to 100%), desirably at least about 85%, favorably at least about 90%, and more desirably at least about 95% amino acid sequence identity with at least one of SEQ ID NOS:66, 67, 69, 89, 93, and 108-110. Preferably, the polypeptide comprises an amino acid sequence selected from the group of SEQ ID NOS:65-116 or an amino acid sequence that is at least about 95%, 98%, 99%, or more identical with of SEQ ID NOS:65-116. Even more preferably, the polypeptide comprises an amino acid sequence selected from (or having at least about 98%, about 99%, or more identity with) SEQ ID NOS:66, 67, 69, 89, 93, and 108-110.

In another aspect, the invention provides a recombinant or synthetic polypeptide comprising an amino acid sequence that has at least about 75%, 80%, 85%, 86%, 87%, 88% or 89%, preferably at least about 90%, 91%, 92%, 93%, or 94%, and more preferably at least about 95% (e.g., about 87-95%), 96% 97%, 98%, 99%, 99.5%, or more amino acid sequence identity to the amino acid sequence of at least one of SEQ ID NOS:65-116, wherein recombinant or synthetic polypeptide induces an immune response in a subject against at least one dengue virus of at least one serotype selected from the group of dengue-1, dengue-2, dengue-3, and dengue-4.

Some such PRM15/truncated E or PRM15/full length E polypeptides induce an immune response in a subject, or population of cells of the subject, against at least one dengue virus of at least one serotype selected from the group of dengue-1, dengue-2, dengue-3, and dengue-4 that is about equal to or greater than an immune response induced against the at least one dengue virus of the at least one serotype by a wild-type PRM15/truncated E or wild-type PRM15/full E polypeptide, respectively, of the at least one serotype. In one aspect, each wild-type PRM15/truncated E protein polypeptide is selected from SEQ ID NOS:149-152. Some such PRM15/truncated E or PRM15/full E polypeptides induce an immune response in a subject or its cells against at least one dengue virus of each of at least two, at least three, or preferably at least four serotypes that is about equal to or greater than an immune response induced in the subjects or cells thereof against the at least one dengue virus of each of the at least two, at least three, or at least four serotypes, respectively, by a wild-type PRM15/truncated E polypeptide or wild-type PRM15/full E polypeptide of each of the at least two, at least three, or at least four serotypes, respectively. Some such PRM15/tE polypeptides comprise an amino acid sequence that has at least about 90% amino acid sequence identity to the sequence of at least one of SEQ ID NOS:66, 67, 69, 89, 93, and 108-110. In one particular aspect, the polypeptide comprises an amino acid sequence selected from the group of SEQ ID NOS:66, 67, 69, 89, 93, and 108-110.

Some such PRM15/tE and full E polypeptides induces production of one or more antibodies that bind to at least one dengue virus of at least 1, at least 2, preferably at least 3, and more preferably at least 4 dengue virus serotypes (e.g., dengue-1, dengue-2, dengue-3, and/or dengue-4). In one aspect, the number of antibodies produced by a PRM15/truncated E polypeptide that bind to at least one dengue virus of at least one serotype equal to or greater than the number of antibodies induced by a WT PRM15/truncated E protein (also referred to as “PRM15/truncated E polypeptide”) of the at least one serotype, wherein each WT PRM15/truncated E protein polypeptide is selected from SEQ ID NOS:149-152.

Some such PRM15/tE polypeptides, which typically comprise an amino acid sequence that has at least about 90% sequence identity to the sequence of at least one of SEQ ID NOS:65-116, induce production of a number of antibodies that bind to at least one dengue virus of each of the at least two or three serotypes that is about equal to or greater than is induced by a wild-type PRM15/truncated E fusion protein of each of the at least two or three serotypes, wherein each wild-type PRM15/truncated E fusion protein of the selected serotype is selected from SEQ ID NOS:149-152. For some such polypeptides, the induced number of antibodies that bind to at least one dengue virus of each of the at least four serotypes is about equal to or greater than is induced by any of SEQ ID NOS:149-152.

Furthermore, some such PRM15/tE polypeptide induces production of one or more antibodies that bind more specifically to at least one dengue virus of at least one serotype than is induced by a wild-type PRM15/truncated E polypeptide of the at least one dengue virus of the at least one serotype, wherein the particular wild-type PRM15/truncated E fusion protein is selected from SEQ ID NOS:149-152.

In another aspect, recombinant PRM15/tE polypeptides and PRM15/full length E polypeptides of invention induce or produce a titer of neutralizing antibodies against at least one dengue virus of at least one dengue virus serotype, preferably at least one dengue virus of each of at least two serotypes, more preferably at least one dengue virus of each of at least three serotypes, and even more preferably at least one dengue virus of each of at least four serotypes selected from dengue-1, dengue-2, dengue-3, and dengue-4.

In one aspect of the invention, the titer of neutralizing antibodies that is about equal to or greater than a titer of neutralizing antibodies produced by the PRM15/tE or PRM15/tE polypeptide against at least one dengue virus of at least one serotype at least is equal or greater than that produced by a wild-type PRM15/truncated E polypeptide or wild-type PRM15/full E polypeptide of the at least one dengue virus of the at least one serotype.

In one particular aspect, for some such PRM15/tE polypeptides, the titer of antibodies induced against at least one dengue virus of each of at least two, three, or four dengue virus serotypes is equal to or greater than against at least one dengue virus of at least one serotype at least is equal or greater than that induced by a wild-type PRM15/truncated E polypeptide of the same dengue virus of each of the at least two, three, or four serotypes, wherein the particular wild-type PRM15/truncated E polypeptide of the same serotype (for comparison) fusion protein is selected from SEQ ID NOS:149-152.

Some such PRM15/tE polypeptides induce at least one neutralizing antibody response in a mammal to or against at least one dengue virus of each of at least two, three or four serotypes without an occurrence of antibody-dependent enhancement (ADE) upon contact of the mammal with the at least dengue virus of each of the at least two, three, or four serotypes, respectively.

In one aspect of the invention, a recombinant polypeptide that induces a neutralizing antibody response against all four dengue virus serotypes comprises an amino acid sequence having at least about 90% sequence identity to the amino acid sequence of at least one of SEQ ID NOS:66, 67, 69, 89, 93, and 108-110.

In another aspect, the invention further provides a recombinant PRM15/truncated E polypeptide that comprises an immunogenic amino acid sequence of the sequence pattern Met Xaa₁ Xaa₂ Xaa₃ Phe Ile Leu Xaa₄ Met Leu Val Xaa₅ Pro Ser Xaa₆ Xaa₇ Met Arg Cys Xaa₈ Gly Xaa₉ Xaa₁₀ Asn Xaa₁₁ Asp Phe Val Glu Gly Xaa₁₂ Ser Gly Xaa₁₃ Xaa₁₄ Trp Val Asp Xaa₁₅ Val Leu Glu His Gly Xaa₁₆ Cys Val Thr Thr Met Ala Xaa₁₇ Xaa₁₈ Lys Pro Thr Leu Asp Xaa₁₉ Glu Leu Xaa₂₀ Lys Thr Xaa₂₁ Xaa₂₂ Xaa₂₃ Xaa₂₄ Xaa₂₅ Ala Xaa₂₆ Leu Arg Xaa₂₇ Xaa₂₈ Cys Ile Glu Ala Xaa₂₉ Xaa₃₀ Xaa₃₁ Asn Xaa₃₂ Thr Thr Xaa₃₃ Xaa₃₄ Arg Cys Pro Thr Gln Gly Glu Xaa₃₅ Xaa₃₆ Xaa₃₇ Xaa₃₈ Glu Glu Gln Asp Xaa₃₉ Xaa₄₀ Xaa₄₁ Xaa₄₂ Cys Xaa₄₃ Xaa₄₄ Xaa₄₅ Xaa₄₆ Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Xaa₄₇ Xaa₄₈ Xaa₄₉ Thr Cys Ala Xaa₅₀ Phe Xaa₅₁ Cys Xaa₅₂ Xaa₅₃ Xaa₅₄ Xaa₅₅ Glu Gly Xaa₅₆ Xaa₅₇ Val Gln Xaa₅₈ Glu Asn Leu Xaa₅₉ Tyr Xaa₆₀ Xaa₆₁ Xaa₆₂ Xaa₆₃ Thr Xaa₆₄ His Xaa₆₅ Gly Xaa₆₆ Xaa₆₇ His Xaa₆₈ Val Gly Asn Xaa₆₉ Thr Xaa₇₀ Xaa₇₁ Xaa₇₂ Gly Xaa₇₃ Xaa₇₄ Xaa₇₅ Xaa₇₆ Ile Thr Pro Gln Xaa₇₇ Xaa₇₈ Xaa₇₉ Xaa₈₀ Glu Xaa₈₁ Xaa₈₂ Leu Xaa₈₃ Xaa₈₄ Tyr Gly Xaa₈₅ Xaa₈₆ Xaa₈₇ Xaa₈₈ Xaa₈₉ Cys Ser Pro Arg Thr Gly Leu Asp Phe Asn Xaa₉₀ Xaa₉₁ Xaa₉₂ Leu Xaa₉₃ Xaa₉₄ Met Lys Xaa₉₅ Lys Xaa₉₆ Trp Xaa₉₇ Val His Xaa₉₈ Gln Trp Xaa₉₉ Xaa₁₀₀ Asp Leu Pro Leu Pro Trp Thr Xaa₁₀₁ Gly Ala Xaa₁₀₂ Thr Xaa₁₀₃ Xaa₁₀₄ Xaa₁₀₅ Xaa₁₀₆ Trp Asn Xaa₁₀₇ Lys Glu Xaa₁₀₈ Xaa₁₀₉ Val Thr Phe Lys Xaa₁₁₀ Xaa₁₁₁ His Ala Lys Xaa₁₁₂ Gln Xaa₁₁₃ Val Xaa₁₁₄ Val Leu Gly Ser Gln Glu Gly Xaa₁₁₅ Met His Xaa₁₁₆ Ala Leu Xaa₁₁₇ Gly Xaa₁₁₈ Thr Glu Xaa₁₁₉ Xaa₁₂₀ Xaa₁₂₁ Xaa₁₂₂ Xaa₁₂₃ Gly Xaa₁₂₄ Thr Xaa₁₂₅ Xaa₁₂₆ Phe Xaa₁₂₇ Gly Xaa₁₂₈ Leu Lys Cys Xaa₁₂₉ Xaa₁₃₀ Xaa₁₃₁ Met Xaa₁₃₂ Lys Leu Xaa₁₃₃ Xaa₁₃₄ Lys Gly Xaa₁₃₅ Ser Tyr Xaa₁₃₆ Met Cys Thr Gly Xaa₁₃₇ Phe Xaa₁₃₈ Xaa₁₃₉ Xaa₁₄₀ Lys Glu Xaa₁₄₁ Ala Glu Thr Gln His Gly Thr Xaa₁₄₂ Xaa₁₄₃ Xaa₁₄₄ Xaa₁₄₅ Val Xaa₁₄₆ Tyr Xaa₁₄₇ Gly Xaa₁₄₈ Xaa₁₄₉ Xaa₁₅₀ Pro Cys Lys Ile Pro Xaa₁₅₁ Xaa₁₅₂ Xaa₁₅₃ Xaa₁₅₄ Asp Xaa₁₅₅ Xaa₁₅₆ Xaa₁₅₇ Xaa₁₅₈ Xaa₁₅₉ Xaa₁₆₀ Xaa₁₆₁ Gly Arg Leu Ile Thr Xaa₁₆₂ Asn Pro Xaa₁₆₃ Val Xaa₁₆₄ Xaa₁₆₅ Lys Xaa₁₆₆ Xaa₁₆₇ Pro Val Asn Ile Glu Xaa₁₆₈ Glu Pro Pro Phe Gly Xaa₁₆₉ Ser Xaa₁₇₀ Ile Xaa₁₇₁ Xaa₁₇₂ Gly Xaa₁₇₃ Xaa₁₇₄ Xaa₁₇₅ Xaa₁₇₆ Xaa₁₇₇ Leu Xaa₁₇₈ Xaa₁₇₉ Xaa₁₈₀ Trp Xaa₁₈₁ Xaa₁₈₂ Lys Gly Ser Ser Ile Gly Xaa₁₈₃ Met Phe Glu Xaa₁₈₄ Thr Xaa₁₈₅ Arg Gly Ala Xaa₁₈₆ Arg Met Ala Ile Leu Gly Xaa₁₈₇ Thr Ala Trp Asp Xaa₁₈₈ Gly Ser Xaa₁₈₉ Xaa₁₉₀ Xaa₁₉₁ Xaa₁₉₂ Xaa₁₉₃ Xaa₁₉₄ Xaa₁₉₅ Xaa₁₉₆ Xaa₁₉₇ Xaa₁₉₈ Xaa₁₉₉ Xaa₂₀₀ Xaa₂₀₁ Xaa₂₀₂ Xaa₂₀₃ Xaa₂₀₄ Xaa₂₀₅ Xaa₂₀₆ Xaa₂₀₇ Xaa₂₀₈ Xaa₂₀₉ Xaa₂₁₀ Xaa₂₁₁ Xaa₂₁₂ Xaa₂₁₃ (SEQ ID NO:51) wherein Xaa at a particular position represents either any or no amino acid residue at that position (typically and preferably, less than twenty of the variable positions are single residue deletions—i.e., represent no amino acid at the indicated position). Preferably, the polypeptide in this aspect typically is characterized by the presence of naturally occurring amino acids, and preferably amino acids that have a hydropathy score above 0. Preferred amino acids for each of the variable positions (as designated by the subscripted numbers in the sequence pattern) are set forth in Table 3.

TABLE 3 X₁: A V T G X₂: V I X₃: I F X₄: L M X₅: A T X₆: Y M X₇: A T G X₈: V I X₉: V I T X₁₀: S G X₁₁: R G X₁₂: L V X₁₃: A G X₁₄: A T S X₁₅: L V X₁₆: S G X₁₇: K R Q X₁₈: G N X₁₉: I F X₂₀: L I Q X₂₁: T I E X₂₂: A V X₂₃: K T X₂₄: Q E N X₂₅: L V P X₂₆: L V T X₂₇: K T X₂₈: L Y X₂₉: K S X₃₀: L I X₃₁: T S X₃₂: I T X₃₃: A E D X₃₄: T S X₃₅: A P X₃₆: I T Y N X₃₇: L M X₃₈: K V P X₃₉: T Q X₄₀: Q N X₄₁: F Y X₄₂: V I X₄₃: K R X₄₄: H R X₄₅: T S D X₄₆: V F Y M X₄₇: S G X₄₈: L V I X₄₉: V I X₅₀: K M X₅₁: K T Q X₅₂: L V K X₅₃: K T E X₅₄: K P N X₅₅: L I M X₅₆: K N X₅₇: V I X₅₈: H P Y X₅₉: K E X₆₀: T S X₆₁: V I X₆₂: V I X₆₃: V I X₆₄: V P X₆₅: T S X₆₆: E D X₆₇: Q E X₆₈: A Q X₆₉: E D X₇₀: T S G — X₇₁: K E N — X₇₂: H Q X₇₃: V K T X₇₄: T I E X₇₅: A V I X₇₆: K T E X₇₇: A S X₇₈: S P X₇₉: T I X₈₀: V T S X₈₁: A I X₈₂: I Q E X₈₃: T P X₈₄: G E D X₈₅: A T X₈₆: L V X₈₇: T G X₈₈: L M X₈₉: E D X₉₀: R E X₉₁: V M X₉₂: V I X₉₃: L M X₉₄: K T X₉₅: K S N X₉₆: A T S X₉₇: L M X₉₈: K R G X₉₉: L F X₁₀₀: L F X₁₀₁: A S X₁₀₂: T S D X₁₀₃: S E X₁₀₄: V T Q E X₁₀₅: V H E P X₁₀₆: T H — X₁₀₇: H R X₁₀₈: L R X₁₀₉: L M X₁₁₀: V T N X₁₁₁: A P X₁₁₂: K R X₁₁₃: E D X₁₁₄: V T X₁₁₅: A T X₁₁₆: T S X₁₁₇: A T X₁₁₈: A T X₁₁₉: V I X₁₂₀: Q D X₁₂₁: T S N M X₁₂₂: S G X₁₂₃: S D X₁₂₄: T N X₁₂₅: L T H X₁₂₆: L I M X₁₂₇: A T X₁₂₈: H R X₁₂₉: K R X₁₃₀: L V X₁₃₁: K R X₁₃₂: E D X₁₃₃: T R Q X₁₃₄: L I X₁₃₅: V M X₁₃₆: V T S X₁₃₇: K S X₁₃₈: K Q X₁₃₉: L I X₁₄₀: V E X₁₄₁: V I X₁₄₂: V I X₁₄₃: L V X₁₄₄: V I X₁₄₅: K R Q X₁₄₆: K Q E X₁₄₇: K E X₁₄₈: T E D X₁₄₉: G D X₁₅₀: A S X₁₅₁: L V F X₁₅₂: S E X₁₅₃: I T S X₁₅₄: Q E M X₁₅₅: L G E X₁₅₆: K Q E X₁₅₇: K G X₁₅₈: K V R X₁₅₉: A T H X₁₆₀: V H Q X₁₆₁: L N X₁₆₂: A V X₁₆₃: A V I X₁₆₄: I T X₁₆₅: K E D X₁₆₆: E D X₁₆₇: K S E X₁₆₈: L A X₁₆₉: E D X₁₇₀: Y N X₁₇₁: V I X₁₇₂: V I X₁₇₃: A V I X₁₇₄: G E X₁₇₅: E P D X₁₇₆: K S G X₁₇₇: A Q X₁₇₈: K T X₁₇₉: L I X₁₈₀: S H N X₁₈₁: F Y X₁₈₂: K R X₁₈₃: K Q X₁₈₄: A T S X₁₈₅: A Y M X₁₈₆: K R X₁₈₇: E D X₁₈₈: L F X₁₈₉: A L V I X₁₉₀: G Y X₁₉₁: T G X₁₉₂: L V — X₁₉₃: L F — X₁₉₄: T N — X₁₉₅: S — X₁₉₆: L V I — X₁₉₇: G — X₁₉₈: K — X₁₉₉: A — M X₂₀₀: L V — X₂₀₁: H — X₂₀₂: Q — X₂₀₃: V I — X₂₀₄: F — X₂₀₅: G — X₂₀₆: A S — X₂₀₇: V I — X₂₀₈: F Y — X₂₀₉: T G — X₂₁₀: A T S — X₂₁₁: V —M X₂₁₂: G F — X₂₁₃: K G —

As used in Table 3, a dash (-) represents that a single residue deletion can be preferred at the indicated position in the sequence pattern (i.e., the particular position in the sequence pattern can lack any amino acid residue). Desirably, the polypeptide has an amino acid sequence wherein each of the above-identified variable positions is filled by one of the preferred residues listed in Table 2 (or a single residue deletion, if applicable).

In a particularly preferred aspect, the invention provides a recombinant PRM15/truncated E polypeptide comprising an immunogenic amino acid sequence of the sequence pattern Met Xaa₁ Xaa₂ Xaa₃ Phe Ile Leu Xaa₄ Met Leu Val Xaa₅ Pro Ser Xaa₆ Xaa₇ Met Arg Cys Val Gly Xaa₈ Gly Asn Arg Asp Phe Val Glu Gly Xaa₉ Ser Gly Xaa₁₀ Xaa₁₁ Trp Val Asp Xaa₁₂ Val Leu Glu His Gly Xaa₁₃ Cys Val Thr Thr Met Ala Lys Asn Lys Pro Thr Leu Asp Xaa₁₄ Glu Leu Xaa₁₅ Lys Thr Xaa₁₆ Xaa₁₇ Xaa₁₈ Xaa₁₉ Xaa₂₀ Ala Xaa₂₁ Leu Arg Xaa₂₂ Xaa₂₃ Cys Ile Glu Ala Xaa₂₄ Ile Xaa₂₅ Asn Xaa₂₆ Thr Thr Xaa₂₇ Xaa₂₈ Arg Cys Pro Thr Gln Gly Glu Xaa₂₉ Xaa₃₀ Leu Xaa₃₁ Glu Glu Gln Asp Xaa₃₂ Xaa₃₃ Xaa₃₄ Xaa₃₅ Cys Xaa₃₆ Xaa₃₇ Xaa₃₈ Xaa₃₉ Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser Xaa₄₀ Xaa₄₁ Thr Cys Ala Lys Phe Xaa₄₂ Cys Xaa₄₃ Xaa₄₄ Xaa₄₅ Xaa₄₆ Glu Gly Xaa₄₇ Xaa₄₈ Val Gln Xaa₄₉ Glu Asn Leu Xaa₅₀ Tyr Thr Xaa₅₁ Xaa₅₂ Ile Thr Xaa₅₃ His Xaa₅₄ Gly Xaa₅₅ Xaa₅₆ His Xaa₅₇ Val Gly Asn Asp Thr Xaa₅₈ Xaa₅₉ Xaa₆₀ Gly Xaa₆₁ Xaa₆₂ Xaa₆₃ Xaa₆₄ Ile Thr Pro Gln Xaa₆₅ Ser Xaa₆₆ Xaa₆₇ Glu Ala Xaa₆₈ Leu Xaa₆₉ Xaa₇₀ Tyr Gly Thr Xaa₇₁ Xaa₇₂ Xaa₇₃ Glu Cys Ser Pro Arg Thr Gly Leu Asp Phe Asn Xaa₇₄ Xaa₇₅ Xaa₇₆ Leu Leu Thr Met Lys Xaa₇₇ Lys Xaa₇₈ Trp Xaa₇₉ Val His Xaa₈₀ Gln Trp Phe Xaa₈₁ Asp Leu Pro Leu Pro Trp Thr Xaa₈₂ Gly Ala Xaa₈₃ Thr Xaa₈₄ Xaa₈₅ Xaa₈₆ Xaa₈₇ Trp Asn Xaa₈₈ Lys Glu Xaa₈₉ Xaa₉₀ Val Thr Phe Lys Xaa₉₁ Xaa₉₂ His Ala Lys Xaa₉₃ Gln Xaa₉₄ Val Xaa₉₅ Val Leu Gly Ser Gln Glu Gly Ala Met His Xaa₉₆ Ala Leu Xaa₉₇ Gly Ala Thr Glu Xaa₉₈ Xaa₉₉ Xaa₁₀₀ Xaa₁₀₁ Xaa₁₀₂ Gly Xaa₁₀₃ Xaa₁₀₄ Xaa₁₀₅ Xaa₁₀₆ Phe Xaa₁₀₇ Gly His Leu Lys Cys Xaa₁₀₈ Xaa₁₀₉ Xaa₁₁₀ Met Asp Lys Leu Xaa₁₁₁ Leu Lys Gly Xaa₁₁₂ Ser Tyr Xaa₁₁₃ Met Cys Thr Gly Xaa₁₁₄ Phe Xaa₁₁₅ Xaa₁₁₆ Xaa₁₁₇ Lys Glu Xaa₁₁₈ Ala Glu Thr Gln His Gly Thr Xaa₁₁₉ Xaa₁₂₀ Xaa₁₂₁ Xaa₁₂₂ Val Xaa₁₂₃ Tyr Xaa₁₂₄ Gly Xaa₁₂₅ Xaa₁₂₆ Xaa₁₂₇ Pro Cys Lys Ile Pro Xaa₁₂₈ Xaa₁₂₉ Xaa₁₃₀ Xaa₁₃₁ Asp Xaa₁₃₂ Xaa₁₃₃ Xaa₁₃₄ Xaa₁₃₅ Xaa₁₃₆ Xaa₁₃₇ Xaa₁₃₈ Gly Arg Leu Ile Thr Xaa₁₃₉ Asn Pro Xaa₁₄₀ Val Xaa₁₄₁ Xaa₁₄₂ Lys Xaa₁₄₃ Xaa₁₄₄ Pro Val Asn Ile Glu Xaa₁₄₅ Glu Pro Pro Phe Gly Xaa₁₄₆ Ser Xaa₁₄₇ Ile Xaa₁₄₈ Xaa₁₄₉ Gly Xaa₁₅₀ Xaa₁₅₁ Xaa₁₅₂ Xaa₁₅₃ Xaa₁₅₄ Leu Xaa₁₅₅ Xaa₁₅₆ Xaa₁₅₇ Trp Xaa₁₅₈ Xaa₁₅₉ Lys Gly Ser Ser Ile Gly Xaa₁₆₀ Met Phe Glu Xaa₁₆₁ Thr Xaa₁₆₂ Arg Gly Ala Xaa₁₆₃ Arg Met Ala Ile Leu Gly Xaa₁₆₄ Thr Ala Trp Asp Phe Gly Ser Xaa₁₆₅ Gly Gly Xaa₁₆₆ Xaa₁₆₇ Thr Ser Xaa₁₆₈ Gly Lys Xaa₁₆₉ Xaa₁₇₀ His Gln Xaa₁₇₁ Phe Gly Xaa₁₇₂ Xaa₁₇₃ Tyr Xaa₁₇₄ Xaa₁₇₅ Xaa₁₇₆ Xaa₁₇₇ Xaa₁₇₈ (SEQ ID NO:50), wherein Xaa represents any amino acid residue (usually a naturally occurring amino acid residue, and more preferably an amino acid residue with a hydropathy score of above 0) or a single residue deletion, as described above (typically, less than twenty of the variable positions are single residue deletions—i.e., represent no amino acid at the indicated position).

Preferred amino acids for the each variable positions (as designated by the subscripted numbers in the sequence pattern) are set forth in Table 4.

TABLE 4 X₁: V T G X₂: V I X₃: I F X₄: L M X₅: A T X₆: Y M X₇: A T G X₈: V I X₉: L V X₁₀: A G X₁₁: A T X₁₂: L V X₁₃: S G X₁₄: I F X₁₅: L I X₁₆: T E X₁₇: A V X₁₈: K T X₁₉: E N X₂₀: V P X₂₁: L V T X₂₂: K T X₂₃: L Y X₂₄: K S X₂₅: T S X₂₆: I T X₂₇: A D X₂₈: T S X₂₉: A P X₃₀: I T Y X₃₁: K V P X₃₂: T Q X₃₃: Q N X₃₄: F Y X₃₅: V I X₃₆: K R X₃₇: H R X₃₈: T D X₃₉: V F Y X₄₀: L V X₄₁: V I X₄₂: K T Q X₄₃: L V K X₄₄: K T E X₄₅: K P N X₄₆: L I M X₄₇: K N X₄₈: V I X₄₉: H P X₅₀: K E X₅₁: V I X₅₂: V I X₅₃: V P X₅₄: T S X₅₅: E D X₅₆: Q E X₅₇: A Q X₅₈: S G — X₅₉: K N — X₆₀: H Q X₆₁: V K X₆₂: T E X₆₃: V I X₆₄: K E X₆₅: A S X₆₆: T I X₆₇: V T X₆₈: I E X₆₉: T P X₇₀: G E X₇₁: L V X₇₂: T G X₇₃: L M X₇₄: R E X₇₅: V M X₇₆: V I X₇₇: K N X₇₈: A T S X₇₉: L M X₈₀: K R X₈₁: L F X₈₂: A S X₈₃: T D X₈₄: S E X₈₅: T E X₈₆: V P X₈₇: T H X₈₈: H R X₈₉: L R X₉₀: L M X₉₁: V T N X₉₂: A P X₉₃: K R X₉₄: E D X₉₅: V T X₉₆: T S X₉₇: A T X₉₈: V I X₉₉: Q D X₁₀₀: S M X₁₀₁: S G X₁₀₂: S D X₁₀₃: T N X₁₀₄: T — X₁₀₅: L T X₁₀₆: L I X₁₀₇: A T X₁₀₈: K R X₁₀₉: L V X₁₁₀: K R X₁₁₁: T Q X₁₁₂: V M X₁₁₃: V T S X₁₁₄: K S X₁₁₅: K Q X₁₁₆: L I X₁₁₇: V E X₁₁₈: V I X₁₁₉: V I X₁₂₀: L V X₁₂₁: V I X₁₂₂: K R Q X₁₂₃: K Q E X₁₂₄: K E X₁₂₅: T E D X₁₂₆: G D X₁₂₇: A S X₁₂₈: L F X₁₂₉: S E X₁₃₀: T I S X₁₃₁: Q E M X₁₃₂: L G E X₁₃₃: K Q E X₁₃₄: K G X₁₃₅: V K R X₁₃₆: A T H X₁₃₇: V H Q X₁₃₈: L N X₁₃₉: A V X₁₄₀: V I X₁₄₁: T I X₁₄₂: K E D X₁₄₃: E D X₁₄₄: K S E X₁₄₅: A L X₁₄₆: E D X₁₄₇: Y N X₁₄₈: V I X₁₄₉: V I X₁₅₀: A V I X₁₅₁: G E X₁₅₂: E P D X₁₅₃: K S G X₁₅₄: A Q X₁₅₅: K T X₁₅₆: L I X₁₅₇: S H N X₁₅₈: F Y X₁₅₉: K R X₁₆₀: K Q X₁₆₁: A T S X₁₆₂: A Y M X₁₆₃: K R X₁₆₄: E D X₁₆₅: L V X₁₆₆: L V X₁₆₇: L F X₁₆₈: L I X₁₆₉: A M X₁₇₀: L V X₁₇₁: V I X₁₇₂: A S X₁₇₃: V I X₁₇₄: T G X₁₇₅: A T X₁₇₆: —M X₁₇₇: F — X₁₇₈: G —

Preferred amino acids for the each variable positions (as designated by the subscripted numbers in the sequence pattern) are set forth in Table 4. Desirably, such polypeptide has an immunogenic amino acid sequence wherein each of the above-identified variable positions is filled by one of the residues listed in Table 3 (or a single residue deletion, if applicable).

The invention also provides a recombinant truncated E polypeptide that induces an immune response against at least one dengue virus of each of at least two serotypes that is about equal to or greater than the immune response induced against the at least one dengue of each of the at least two serotypes by a combination of wild-type PRM15/truncated E polypeptides of the at least two serotypes, wherein each said wild-type PRM15/truncated E polypeptide is selected from SEQ ID NOS:149-152.

C15/Full Length prM/Full Length E Polypeptides

The invention also provides a recombinant or synthetic polypeptide comprising an amino acid sequence that has at least about 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more amino acid sequence identity to an amino acid sequence of at least one of SEQ ID NOS:139-148, 236-253, 343, and 345. In another aspect, a recombinant or synthetic polypeptide comprising an amino acid sequence that has at least about 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more amino acid sequence identity to an amino acid sequence of at least one of SEQ ID NOS:139-145, 147-148, 236-253, 343, and 345 is provided. Such polypeptides of the invention are typically termed C15/full length prM/full length E protein polypeptides (or simply “C15/full prM/full E” polypeptides”).

Such C15/full length prM/full length E polypeptides induce an immune response in a subject, e.g., mammal against at least one dengue virus of at least one serotype selected from the group of dengue-1, dengue-2, dengue-3, and dengue-4. Further, some such polypeptides induce an immune response in a subject against at least one dengue virus of each of at least two, preferably at least three, and more preferably at least four serotypes selected from the group of dengue-1, dengue-2, dengue-3, and dengue-4. Preferably, the immune response induced against at least one dengue virus of the at least one serotypes is about equal to or greater than an immune response induced against the at least one dengue virus of the at least one serotype by a corresponding WT C15/full length prM/full length E fusion protein of the at least one serotype, wherein the WT C15/full length prM/full length E fusion protein of the same serotype for comparison is selected from SEQ ID NOS:227-230.

In one particular aspect, the immune response induced by the recombinant C15/full prM/full E polypeptide induced against at least one dengue virus of each of the four serotypes is about equal to or greater than that induced against that at least one dengue virus by any sequence selected from the group of SEQ ID NOS:227-230.

Such C15/full prM/full E polypeptides induce production of one or more antibodies that bind to at least one dengue virus of at least one serotype. Preferably, such polypeptides induces production of one or more antibodies that bind to at least one dengue virus of each of at least two, at least three, or preferably at least four dengue virus serotypes.

Such polypeptides induce the production of a number of antibodies that bind to at least one dengue virus of at least one serotype that is about equal to or greater than the number of antibodies that bind to the at least one dengue virus of the at least one serotype induced by a WT C15/full length prM/full length E fusion protein of the at least one dengue virus of the at least one serotype, wherein the WT C15/full length prM/full length E fusion protein of each particular serotype is selected for comparison from SEQ ID NOS:227-230.

In one aspect, a recombinant C15/full length prM/full length E polypeptide induces production of a number of antibodies that bind to at least one dengue virus of each of the at least two or at least three serotypes, wherein the number is about equal to or greater than the number of antibodies that bind to the at least one dengue virus of each of the at least two or at least three serotypes that are induced by a wild-type C15/full length prM/full length E polypeptide corresponding to each of the at least two or three serotypes selected from SEQ ID NOS:227-230, respectively. In one particular embodiment, a C15/full prM/full E polypeptide induces production of a number of antibodies that bind to at least one dengue virus of each of dengue-1, dengue-2, dengue-3, and dengue-4 that is about equal to or greater the number of antibodies that bind to the at least one dengue virus of each of those four serotypes that are induced by any one SEQ ID NOS:227-230, respectively.

Such recombinant wild-type C15/full length prM/full length E polypeptides induce the production of antibodies that bind more specifically to at least one particular dengue virus of at least one particular serotype than is induced by a corresponding wild-type C15/full length prM/full length E polypeptide from the dengue virus of the same one serotype, wherein the wild-type C15/full length prM/full length E polypeptide is selected from SEQ ID NOS:227-230.

Another characteristic of a recombinant C15/full length prM/full length E polypeptide of the invention is the ability to induce the production of a titer of neutralizing antibodies against at the least one dengue virus of the at least one serotype. In one aspect, the polypeptide produces a titer of neutralizing antibodies against at least one dengue virus of each of at least two, at least three, or at least four serotypes. Some such polypeptides induce at least one neutralizing antibody response in a subject to or against at least one dengue virus of each of at least two, three or four serotypes without an occurrence of antibody-dependent enhancement (ADE) upon contact of the subject with the at least dengue virus of each of the at least two, three, or four serotypes, respectively.

Some such polypeptides produce a titer of neutralizing antibodies in a subject against at least one dengue virus of at least one serotype that is about equal to or greater than a titer of neutralizing antibodies produced in the subject against the at least one dengue virus of the at least one serotype by a WT C15/full length prM/full length E fusion protein of the at least one dengue virus of the at least one serotype. The WT C15/full length prM/full length E fusion protein can be selected from SEQ ID NOS:227-230. Further, some polypeptides produce a titer of neutralizing antibodies against at least one dengue virus of each of at least two, three, or four serotypes that is about equal to or greater than a titer of neutralizing antibodies produced against the at least one dengue virus of each of at least two, three, or four serotypes by a WT C15/full length prM/full length E fusion protein of the at least one dengue virus of each of the at least two, three, or four serotypes, respectively, wherein each WT C15/full length prM/full length E polypeptide is selected from SEQ ID NOS:227-230.

In one aspect, polypeptides having these immunogenic and immune-stimulating properties comprise an amino acid sequence that has at least about 95% amino acid sequence identity to the amino acid sequence of SEQ ID NO:140 or SEQ ID NO:141. In one particular aspect, such a polypeptide comprises SEQ ID NO:141.

Some such C15/full length prM/full length E polypeptides of the invention induce an immune response against at least one dengue virus of each of at least two serotypes that is about equal to or greater than the immune response induced against any of the at least one dengue virus of each of the at least two serotypes by a combination of WT C15/full length prM/full length E polypeptides of each of the at least two serotypes. WT C15/full length prM/full length E polypeptides can be selected from the group of SEQ ID NOS:227-230.

Recombinant C15/full length prM/full length E polypeptides exhibit additional biological properties that can be favorable for inducing, promoting, modulating, and/or enhancing an immune response, such as, e.g., the ability to form immunogenic viruses or virus-like particles in cells of a subject, including, e.g., mammalian cells. The invention provides a population of such recombinant polypeptides, wherein such polypeptides are capable of assembling with one another (and with other polypeptides and nucleic acids, as desired) to form one or more VLPs or viruses.

The invention also provides an antigenic or immunogenic fragment of any such polypeptide of the invention. The antigenic or immunogenic fragment of the polypeptide may comprise an amino acid sequence of about 10, about 20, about 30, or about 50 amino acids that comprises at least one T cell epitope not present in corresponding wild-type dengue virus C15, prM and E protein amino acid sequences, wherein the novel epitope is derived from one of the novel immunogenic amino acid sequences disclosed herein. Extension of the polypeptide by, e.g., inclusion of additional amino acid residue(s) and/or polypeptide or peptide segment (e.g., a signal peptide sequence or C-terminal E protein sequence, N-terminal prM protein sequence, or C15/N-terminal prM protein sequence) increases the length of the polypeptide. Common and preferable sizes for such polypeptides are further discussed elsewhere herein.

C Terminal E Protein Fragment Polypeptides

In one aspect of the invention, recombinant immunogenic or antigenic truncated E polypeptides, PRM15/truncated E polypeptides, and C15/full length prM/truncated E polypeptides of the invention may further comprise an additional amino acid sequence that is similar, substantially similar, or identical to an amino sequence segment or fragment (e.g., hydrophobic amino acid sequence) of the C-terminus transmembrane domain of a wild-type. E protein of a flavivirus or a recombinant E protein of a flavivirus (such as, preferably, a dengue virus or yellow fever virus). Such amino acid sequence, which may be termed a C terminal amino acid fragment of a (flavivirus) E protein, C terminal E protein fragment (or simply “rest of envelope” or “rest of env” sequence), is typically about 20 to about 70 amino acid residues in length, more typically about 40 to about 65 amino acid residues in length, and often about 40 to about 65 amino acid residues in length. Such an amino acid fragment may comprise the amino acid sequence corresponding to the “stem-anchor region” of an E protein, which would typically anchor the remainder of the E protein (e.g., the truncated E protein portion) to the cell membrane when such E protein was incorporated into a virus.

Some such C terminal E protein fragments each comprise an amino acid sequence that has at least about 45%, desirably at least about 50% or 55% (e.g., about 60-99%), favorably at least about 60% or 65%, more favorably at least about 70% or 75%, advantageously at least about 80% or 85%, and preferably at least about 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% or more amino acid sequence identity to at least one amino acid sequence of the group of SEQ ID NOS:127-136.

Each such C terminal E protein fragment sequence is positioned near to (e.g., is within about 20 amino acids of) or fused to any of the following: 1) the C terminal amino acid of a recombinant truncated envelope (tE) protein polypeptide of the invention (e.g., SEQ ID NOS:1-49 and 154-155); 2) the C terminal amino acid of a recombinant PRM15/tE polypeptide of the invention (e.g., SEQ ID NOS:65-116); 3) the C terminal amino acid of a recombinant full length prM/tE fusion protein of the invention; or 4) the C terminal amino acid of a recombinant C15/full length prM/tE protein polypeptide of the invention as describe above. Such C terminal E protein fragment serves to extend the length of the truncated E polypeptide to a length that is about equal to or substantially equivalent to the length of a full length E polypeptide of a wildtype flavivirus, preferably a dengue virus of a particular serotype. Thus, a recombinant truncated E polypeptide, PRM15/truncated E polypeptide, and C15/full length prM/truncated E polypeptide of the invention that further comprises such a C terminal E polypeptide are typically referred to as a recombinant full length E polypeptide, PRM15/full length E polypeptide, and C15/full length prM/full length E polypeptide, respectively.

In one aspect, the amino acid sequence of such C terminal amino acid fragment of a (flavivirus) E protein comprises, or typically consists essentially of, an amino acid sequence according to the sequence pattern: Gly Val Ser Trp Xaa₁ Xaa₂ Xaa₃ Ile Xaa₄ Ile Gly Xaa₅ Xaa₆ Xaa₇ Xaa₈ Trp Xaa₉ Gly Xaa₁₀ Asn Ser Xaa₁₁ Xaa₁₂ Thr Ser Xaa₁₃ Xaa₁₄ Xaa₁₅ Xaa₁₆ Xaa₁₇ Xaa₁₈ Xaa₁₉ Xaa₂₀ Gly Xaa₂₁ Xaa₂₂ Thr Leu Xaa₂₃ Leu Gly Xaa₂₄ Xaa₂₅ Val Xaa₂₆ Ala, wherein Xaa represents any amino acid residue (see SEQ ID NO:137). Preferred amino acid residues for the variable positions in this sequence pattern are provided in Table 5.

TABLE 5 X₁: M I T X₂: V M X₃: K R X₄: L G X₅: V I F X₆: I L X₇: I L V X₈: T L X₉: I L X₁₀: M L T X₁₁: K R X₁₂: S N X₁₃: L M X₁₄: S A X₁₅: V M F X₁₆: S T X₁₇: L C X₁₈: V I X₁₉: L A X₂₀: V I X₂₁: V M I G X₂₂: V I X₂₃: Y F X₂₄: A V F X₂₅: M V T X₂₆: Q H Desirably, each one of the variable positions in this sequence pattern is filled by one of the preferred residues provided in Table 5.

The invention also includes polynucleotides encoding all such recombinant polypeptides as described herein and below. Such polypeptides and polypeptide-encoding nucleotides are useful in methods of the invention described throughout, including, e.g., but not limited to, prophylactic and/or therapeutic methods of treatment to induce, modulate, enhance, and/or promote an immune response(s) to at least one dengue virus of at least one flavivirus serotype in a mammal, and/or methods of detecting or diagnosing the presence of antibodies in a sample that bind to one or more dengue viruses of one or more serotypes.

N Terminal C15/Truncated prM Polypeptides

Recombinant PRM15/tE polypeptides and PRM15/full E polypeptides may further comprise an additional amino acid sequence that has at least about 50%, desirably at least about 60% (e.g., about 65% to about 100%), favorably at least about 70%, preferably at least about 80%, and more preferably at least about 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% or more amino acid sequence identity to an amino acid sequence of at least one of the group of SEQ ID NOS:117-126. In one embodiment, such an amino acid sequence comprises at least the two following segments: (1) the last 15 amino acid residues of recombinant or wild-type capsid (C) protein of a flavivirus (e.g., preferably dengue virus), as measured in reverse sequence order from the C terminus of the capsid protein sequence and (2) all of the amino acid residues of a recombinant or wild-type prM protein of a flavivirus (e.g., preferably dengue virus), as measured in sequence order from the N terminus of the prM protein sequence, except the last 15 amino acid of the C terminus of the prM protein. Typically, segment (1) and segment (2) are attached or fused, and segment (1) precedes segment (2), positioned at the N terminus. Such an amino acid sequence is typically termed an N terminal amino acid fragment sequence of C15/prM or an N terminal C15/truncated prM polypeptide (or “rest of C15/PRM,” since it includes 15 residues from the C protein and the remaining residues of the prM protein, but for the 15 C terminal residues). Such an N terminal C15/truncated prM polypeptide is usually at least about 150, 160, 165, 170, 175, or 180 amino acid residues in length. Some such polypeptide fragments are at least about 165 to about 175 amino acids in length. Some such polypeptide fragments are at least about 167 to about 171 amino acids in length.

An N terminal C15/truncated prM polypeptide is positioned near to or at (e.g., is within about 20 amino acids of), or is fused directly to, the N-terminus of the signal peptide sequence or immunogenic amino acid sequence (e.g., an immunogenic truncated E polypeptide sequence or an immunogenic full length E polypeptide sequence) of an immunogenic polypeptide of the invention, depending on the position (and presence) of the signal peptide sequence in the polypeptide. For example, in one format, an N terminal C15/truncated prM polypeptide is positioned near or at or is fused directly to the N terminus of a recombinant PRM15/tE dengue virus polypeptide of the invention (e.g., SEQ ID NOS:65-116) or the N terminus of a recombinant PRM15/full length E polypeptide of the invention (SEQ ID NOS:139-148, 236-253). In another aspect, the N terminal C15/truncated prM polypeptide is positioned near or at or is fused directly to the N terminus of a recombinant truncated E dengue virus polypeptide of the invention (e.g., SEQ ID NOS:1-49 and 153-155) or the N terminus of a recombinant full length E polypeptide of the invention.

In another aspect, a recombinant PRM15/tE polypeptide or PRM15/full E polypeptide further comprises an N terminal C15/truncated prM polypeptide positioned near to or at or fused directly to the N-terminus of the PRM15/tE polypeptide or PRM15/full E polypeptide, respectively, wherein the N terminal C15/truncated prM polypeptide comprises an amino acid sequence according to the sequence pattern set forth in SEQ ID NO:138, which comprises: Xaa₁ Xaa₂ Xaa₃ Xaa₄ Xaa₅ Xaa₆ Xaa₇ Xaa₈ Xaa₉ Xaa₁₀ Xaa₁₁ Pro Xaa₁₂ Xaa₁₃ Xaa₁₄ Ala Phe Xaa₁₅ Leu Xaa₁₆ Xaa₁₇ Arg Xaa₁₈ Gly Glu Pro Xaa₁₉ Xaa₂₀ Ile Val Xaa₂₁ Xaa₂₂ Xaa₂₃ Glu Xaa₂₄ Gly Xaa₂₅ Xaa₂₆ Leu Leu Phe Lys Thr Xaa₂₇ Xaa₂₈ Gly Xaa₂₉ Asn Xaa₃₀ Cys Thr Leu Xaa₃₁ Ala Xaa₃₂ Asp Leu Gly Glu Xaa₃₃ Cys Xaa₃₄ Asp Thr Xaa₃₅ Thr Tyr Lys Cys Pro Xaa₃₆ Xaa₃₇ Xaa₃₈ Xaa₃₉ Xaa₄₀ Glu Pro Xaa₄₁ Asp Xaa₄₂ Asp Cys Trp Cys Asn Xaa₄₃ Thr Xaa₄₄ Xaa₄₅ Trp Val Xaa₄₆ Tyr Gly Thr Cys Xaa₄₇ Xaa₄₈ Xaa₄₉ Gly Glu Xaa₅₀ Arg Arg Xaa₅₁ Lys Arg Ser Val Ala Leu Xaa₅₂ Pro His Xaa₅₃ Gly Xaa₅₄ Gly Leu Xaa₅₅ Thr Arg Xaa₅₆ Xaa₅₇ Thr Trp Met Ser Xaa₅₈ Glu Gly Ala Trp Xaa₅₉ Xaa₆₀ Xaa₆ Xaa₆₂ Xaa₆₃ Xaa₆₄ Glu Xaa₆₅ Trp Xaa₆₆ Leu Arg Xaa₆₇ Pro Xaa₆₈ Phe Xaa₆₉ Xaa₇₀ Xaa₇₁ Ala Xaa₇₂ Xaa₇₃ Xaa₇₄ Ala Xaa₇₅ Xaa₇₆ Ile Gly Xaa₇₇ Xaa₇₈ Xaa₇₉ Xaa₈₀ Gln Xaa₈₁, where Xaa represents any amino acid residue. Preferred amino acid residues for the variable (Xaa) positions for such a N terminal C15/truncated prM polypeptide sequence are provided in Table 6.

TABLE 6 X₁: M R X₂: R K X₃: S T X₄: V S T A X₅: T L G I X₆: M C V T X₇: I L X₈: L M I X₉: M C X₁₀: L M X₁₁: L I X₁₂: T A X₁₃: A T V X₁₄: L M X₁₅: H S X₁₆: T S X₁₇: T S X₁₈: G D N X₁₉: T L R H X₂₀: L M X₂₁: S A G X₂₂: K R X₂₃: Q H N X₂₄: R K X₂₅: K R X₂₆: S P X₂₇: S T A E X₂₈: A E S D X₂₉: V I X₃₀: M K X₃₁: I M X₃₂: M I X₃₃: L M X₃₄: E D X₃₅: M V I X₃₆: R L H X₃₇: M L I X₃₈: T R V X₃₉: E Q N X₄₀: A N V T X₄₁: D E X₄₂: V I X₄₃: A L S X₄₄: D S X₄₅: T A X₄₆: T M X₄₇: S N T X₄₈: Q T X₄₉: T A S X₅₀: H R X₅₁: D E X₅₂: D A T V X₅₃: V S X₅₄: L M X₅₅: E D X₅₆: T A X₅₇: E Q X₅₈: S A X₅₉: K R X₆₀: H Q X₆₁: I V A X₆₂: Q E X₆₃: K R X₆₄: V I X₆₅: T S X₆₆: A I X₆₇: H N X₆₈: G R X₆₉: T I A X₇₀: V I L X₇₁: I L M X₇₂: L A G X₇₃: F I X₇₄: L M X₇₅: H Y X₇₆: A Y T M X₇₇: T Q X₇₈: T S X₇₉: I L H G X₈₀: T F I X₈₁: K R Desirably, each one of the variable positions of the above-described sequence pattern is filled by one of the above-listed amino acid residues.

In some aspects, a recombinant PRM15/tE polypeptide of the invention (e.g., such as a polypeptide having at least about 85% identity to a polypeptide sequence selected from any of SEQ ID NOS:65-116) further comprises an N terminal C15/truncated prM polypeptide comprising a sequence selected from SEQ ID NOS:117-126 and/or a C terminal E protein fragment polypeptide comprising a sequence selected from SEQ ID NOS:127-136.

In other aspects, a polypeptide of the invention, such as, e.g., a recombinant truncated E polypeptide, PRM15/truncated E polypeptide, or C15/full prM/truncated E polypeptide, desirably comprises a C terminal E protein fragment comprising an amino acid sequence that has less than 100% sequence identity (e.g., about 99%, 98%, 97%, 96%, 95%, 94%, 90%) with a C terminal E protein fragment sequence of a wild-type dengue virus E protein of one of the four serotypes (e.g., SEQ ID NOS:127-130). A polypeptide of the invention, such as, e.g., a recombinant PRM15/truncated E polypeptide or PRM15/full E polypeptide also or alternatively can comprise an N terminal C15/truncated prM polypeptide that has less than 100% amino acid sequence identity (e.g., about 99%, 98%, 97%, 96%, 95%, 94%, 90%) with an N terminal C15/truncated prM polypeptide sequence of a wild-type dengue virus prM protein (e.g., SEQ ID NOS:117-120).

In some aspects, a C15/full prM/full E polypeptide, which includes a C terminal E protein fragment that has less than 100% sequence identity with the sequence of a C terminal E protein of a WT dengue virus E protein and a C15/truncated prM polypeptide that has less than 100% sequence identity with the sequence of an N terminal C15/truncated prM polypeptide of a WT dengue virus, comprises a polypeptide sequence that has at least about 65%, at least about 70%, typically at least about 75% or at least about 80%, preferably at least about 85% (including e.g., at least about 85 to about 99.5%), and more preferably at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99% or more amino acid sequence identity to at least one of SEQ ID NOS:139-148, 236-253, 343, and 345. More preferably, such a polypeptide comprises a polypeptide sequence selected from (or having at least about 80%, about 85%, about 88%, about 90%, about 95%, 97%, about 98%, or about 99% identity with) SEQ ID NOS:139-148, 236-253, 343, and 345. Polypeptides that have at least about 80%, more preferably at least about 85%, and even more preferably at least about 90% (e.g., about 90%, 92%, 93%, 94%, or 95%) identity with SEQ ID NO:140 or SEQ ID NO:141 are a particular aspect of this invention. Preferred polypeptides comprise (or at least have about 96%, about 97%, about 98%, about 99%, or more identity with) SEQ ID NO:141.

The invention also provides for the use of a novel C terminal E protein fragment polypeptide of the invention (e.g., SEQ ID NOS:131-136) and C15/truncated prM polypeptide of the invention (e.g., SEQ ID NOS:121-126) independently of the inclusion of either in any of the above-described polypeptides. For example, polypeptides comprising or consisting essentially of at least one such novel C terminal E protein fragment polypeptide and/or at least one such novel C15/truncated prM polypeptide can be used to induce or promote an immune response to a flavivirus, such as a dengue, virus in a mammalian host; can be used in methods of diagnosis or detecting the presence of antibodies that bind to one or more dengue viruses of a biological sample; and/or can be used in the formation of dengue virus immunogens or antigens. The use of the nucleic acid sequences encoding these C terminal E protein fragment polypeptides and C15/truncated prM polypeptides, alone or in combination with other nucleic acid sequences, including those encoding PRM/tE polypeptides of the invention, also is provided.

The inventors also contemplate the use of novel PRM15-homologs (also referred to as “prM15 homologs”) of the invention (e.g., SEQ ID NOS:56-64), for example, in the formation of dengue virus immunogens or antigens, as signal peptide sequences for truncated dengue virus E protein antigens (e.g., such as SEQ ID NOS:1-49 and 153-155) or full length dengue virus E protein antigens), or as signal peptides for other viral polypeptides, such as flavivirus truncated E proteins or full length E proteins, or non-viral polypeptides. The use of the nucleic acid sequences encoding these signal peptides (e.g., portions of the polynucleotides described herein which code such amino acid sequences), alone or in combination with other nucleic acid sequences, such as, e.g., recombinant truncated E polypeptides of the invention, is also provided. In one aspect, e.g., the use of such a signal peptide nucleic acid sequence with an immunogenic nucleic acid (e.g., nucleic acid encoding any of SEQ ID NOS:1-49 and 153-155) in a DNA vaccine is contemplated.

A immunogenic polypeptide comprising or consisting essentially of an amino acid sequence that is substantially identical (e.g., having at least about 75%, 80%, 85%, 86%, 87%, 88% or 89%, preferably at least about 90%, 91%, 92%, 93%, or 94%, and more preferably at least about 95% (e.g., about 87-95%), 96% 97%, 98%, 99%, 99.5% sequence identity) to an amino acid sequence selected from the group of SEQ ID NOS:1-49 and 153-155 is at least about 400 to about 500 amino acids in length (more typically at least about 440 to about 460 amino acids in length). The lower size limit for the immunogenic amino acid sequence, and, accordingly, the polypeptide itself, typically is only dictated by the desired use of the polypeptide. In aspects where the immunogenic polypeptide is used to promote an immune response to a dengue virus, the size of the polypeptide can be similar to that of at least a truncated dengue virus envelope protein (e.g., about 440 amino acids). Alternatively, the truncated E polypeptide can be extended to a full length E protein polypeptide (having an amino acid sequence about at least as long as a WT dengue virus E protein or WT flavivirus E protein) as described herein. Alternatively, in methods wherein the polypeptide is used to induce an immune response, the polypeptide comprises a PRM15/truncated E polypeptide (e.g., about 435-465 amino acids, or a C15 signal sequence/full length prM/full length E polypeptide (e.g., about 650-680 amino acids).

Viruses and Virus-Like Particles

The invention also provides recombinant or synthetic viruses and virus-like particles (VLPs) comprising one or more of the polypeptides, nucleic acids, or vectors of the invention. Such viruses and VLPs, which may be attenuated, are useful in methods of inducing, modulating, enhancing or promoting an immune response to at least one flavivirus, preferably dengue virus, of at least one serotype as described herein. Such viruses and VLPs are useful in therapeutic and/or prophylactic methods to treat flaviviral infection (e.g., infection by one or more dengue viruses) or protect against infection by a flavivirus (e.g., infection by one or more dengue viruses). Such viruses and VLPs are useful in vaccines to safeguard against dengue viral infection and/or ADE.

In one aspect, the invention provides a virus comprising (a) a nucleic acid comprising a nucleotide sequence that has at least about 90%, 95%, or 100% sequence identity to a nucleotide sequence selected from any of SEQ ID NOS:156-218, 235, 254-271, 285-330, 342, and 344; and/or (b) a polypeptide comprising an amino acid sequence that has at least about 90%, 95%, or 100% sequence identity to a sequence selected from any of SEQ ID NOS:1-49, 65-116, 139-148, 153-155, 236-253, 343, and 345.

Also provided is a chimeric virus comprising: (a) a nucleic acid comprising a nucleotide sequence that has at least about 90, 95, or 100% sequence identity to a sequence selected from any of SEQ ID NOS:156-218, 235, 254-271, 285-330, 342, and 344, and at least one additional nucleic acid from a genome of another virus, including a flavivirus or adenovirus. The flavivirus may be a dengue virus (e.g., DEN-1, DEN-2, DEN-3, DEN-4) or yellow fever virus, Japanese encephalitis virus; Equine encephalitis virus; West Nile virus); and/or (b) a polypeptide comprising an amino acid sequence having at least about 90%, 95%, or 100% sequence identity to a sequence selected from any of SEQ ID NOS:1-49, 65-116, 139-148, 153-155, 236-253, 343, and 345 and at least one additional amino acid comprising a structural or non-structural polypeptide of a flavivirus or a fragment of a structural or non-structural polypeptide of a flavivirus, wherein said flavivirus is a dengue virus (e.g., DEN-1, DEN-2, DEN-3, or DEN-4) or is not a dengue virus (e.g., yellow fever virus, Japanese encephalitis virus; Equine encephalitis virus; or West Nile virus).

In another aspect, the invention provides a method of inducing an immune response in a host against a first flavivirus comprising: (a) providing a nucleic acid comprising a nucleotide sequence that has at least about 90%, 95%, or 100% sequence identity to a sequence selected from any of SEQ ID NOS:156-218, 235, 254-271, 285-330, 342, and 344, wherein said nucleic acid comprises a DNA sequence; (b) generating infectious RNA transcripts from the DNA sequence; (c) introducing the RNA transcripts into a cell; (d) expressing the RNA transcripts in the cell to produce virus; (e) harvesting the virus from said cell; and (g) inoculating the host with virus.

As used herein, the term “virus” includes not only complete virus particles, but also virus-like particles (VLPs) that include one or more polypeptides of the invention. A desirable feature of polypeptides comprising one of the above-described C15/full length prM/full length E polypeptides, and full length prM/truncated E polypeptides is the formation of virus-like particles (VLPs) in a mammalian host by a population of the polypeptides. VLPs lack the viral components that are required for virus replication and thus represent a highly attenuated form of a virus. A VLP of the invention can display a polypeptide (e.g., recombinant antigen) that is protective against one or more flavivirus serotypes, preferably one or more dengue virus serotypes. VLPs can display more than one type of polypeptide (e.g., recombinant antigen); e.g., a VLP can display and thus are useful as a polyvalent vaccine where antigens that are protective. In some embodiments, the methods of the invention are used to obtain VLPs that have desired characteristics or properties as described herein, including e.g., those relating to enhanced cross-protection against and/or cross-reactivity with at least two flavivirus serotypes (preferably at least two dengue virus serotypes), secretion, and/or expression. Viral proteins from several viruses are known to form VLPs, including human papillomavirus, HIV (Kang et al., Biol. Chem. 380: 353-64 (1999)), Semliki-Forest virus (Notka et al., Biol. Chem. 380: 341-52 (1999)), human polyomavirus (Goldmann et al., J. Virol. 73: 4465-9 (1999)), rotavirus (Jiang et al., Vaccine 17: 1005-13 (1999)), parvovirus (Casal, Biotechnology and Applied Biochemistry, Vol 29, Part 2, pp 141-150 (1999)), canine parvovirus (Hurtado et al., J. Virol. 70: 5422-9 (1996)), and hepatitis E virus (Li et al., J. Virol. 71: 7207-13 (1997)).

The formation of such VLPs can be detected by any suitable technique. Examples of suitable techniques known in the art for detection of VLPs in a medium include, e.g., electron microscopy techniques, dynamic light scattering (DLS), selective chromatographic separation (e.g., ion exchange, hydrophobic interaction, and/or size exclusion chromatographic separation of the VLPs) and density gradient centrifugation.

In another aspect, the invention also provides modified, mutant, synthetic or recombinant dengue viruses. Accordingly, the invention provides a modified (e.g., mutant, synthetic, or recombinant) dengue virus that comprises at least one recombinant dengue virus nucleic acid or polypeptide of the invention described herein. In one embodiment, the invention provides a modified or recombinant dengue virus produced by expression or translation of a recombinant nucleic acid of the invention in a population of a subject's cells, e.g., mammalian cells, including, e.g., mouse, primate, and/or human cells. In another embodiment, the invention provides a modified or recombinant dengue virus that comprises at least one recombinant DNA nucleotide sequence of the invention described herein. In yet another embodiment, the invention provides a modified or recombinant dengue virus produced by expression or translation of an RNA nucleic acid in a population of cells, e.g., mammalian cells, the RNA nucleic acid comprising an RNA nucleic acid sequence that comprises a modified or recombinant DNA nucleic acid sequence of the invention, wherein each thymine residue of such DNA nucleic acid sequence is replaced by a uracil residue; the invention also includes a complementary sequence of each said RNA nucleic acid sequence. In another preferred embodiment, the invention provides a modified or recombinant dengue virus comprising an RNA nucleotide sequence, said RNA nucleotide sequence comprising the isolated or recombinant DNA nucleic acid sequence of any of SEQ ID NOS:156-218, 235, 254-271, 285-330, 342, and 344, wherein each thymine residue in each said DNA nucleotide sequence is replaced by a uracil residue; an RNA sequence that is complementary to each said RNA nucleotide sequence is also provided.

Properties and Characteristics

Recombinant, synthetic, mutant, and/or isolated polypeptides of the invention exhibit a variety of properties and characteristics and are useful in a variety of contexts. In one aspect, such polypeptides are useful in methods of detecting or diagnosing of anti-flaviviral antibodies, especially anti-dengue virus antibodies in a biological sample, as described in greater detail below. In another aspect, a characteristic of the recombinant, synthetic, and/or isolated polypeptides of the invention is the promotion of an immune response to at least a portion of a dengue virus (e.g., a dengue virus antigen, a collection of dengue virus antigens, a fragment of a dengue virus, a dengue virus VLP, or an inactivated, attenuated, or virulent dengue virus of one, two, three, or even four serotypes) in an animal or population of animal cells, and preferably in a mammal, even more preferably in a human. “Promotion” encompasses any detectable increase, including induction of an immune response and increase of an already existing immune response. The polypeptides of the invention can induce a cytotoxic (or other T-cell) immune response, a humoral (antibody-mediated) immune response, or (most desirably) both, in such a host. The polypeptide can promote the production of an antibody that binds to at least a portion of a dengue virus (e.g., a particular dengue virus antigen), in a subject, such as a mammal. Desirably, the polypeptide promotes an anamnestic antibody response to a dengue virus (which can be determined by known IgG/IgM kinetics analysis techniques), and the polypeptide induces or promotes a “solid” (anamnestic and non-viremic) antibody response to a dengue virus.

More particular characteristics of immune responses attendant the administration or expression of one or more polypeptides of the invention to a subject host, such as a mammal, include the priming and stimulation of CD4+ and CD8+ lymphocytes, particularly CD8+ lymphocytes, the promotion of host cell production of anti-dengue virus IgM and/or IgG antibodies, T cell activation and cytokine release (including, but not limited to, e.g., release of one or more tumor necrosis factors (TNF) (e.g., TNF-alpha), the production of one or more interleukins (IL) (e.g., IL-1, IL-2, IL-3, IL-4, IL-5, IL-6, IL-10, IL-12), the production of one or more interferons (IFN) (e.g., IFN-gamma, IFN-alpha, EN-beta), TGF from T cells), complement activation, platelet activation, enhanced and/or decreased Th1 responses, enhanced and/or decreased Th2 responses, and humoral immunological memory.

An important characteristic of polypeptides of the invention is the promotion of an immune response to at least one dengue virus of one or more, preferably multiple, serotypes in a subject. For example, the invention provides polypeptides comprising an immunogenic amino acid sequence that induces production of one or more antibodies that bind at least one dengue virus of each of at least two virus serotypes in animals, such as a mammal.

More preferred polypeptides of the invention promote an immune response to one or more dengue virus variants of each of at least three virus serotypes of in an animal, e.g., a mammal. For example, a particular polypeptide of the invention promote the production of one or more antibodies that bind at least one dengue virus of each of at least three virus serotypes in a mammal when an antigenic or immunogenic quantity of such a polypeptide is expressed in, administered to, or delivered to the mammal.

An advantage of polypeptides of the invention is the ability to induce an immune response to at least a portion (preferably to multiple portions—e.g., multiple epitopes) of at least one dengue virus of at least one serotype in vivo and ex vivo (in contrast to polypeptides that only induce such an immune response in cell culture). Some polypeptides of the invention induce production of one or more antibodies (an antibody response) to antigens (including dengue virus Ags) of all four known dengue virus serotypes in a subject (e.g., including, but not limited to SEQ ID NOS:2, 3, 5, 25, 29, 66, 67, 69, 89, 93, 44-46, 108-110, 140, and 141).

An advantage of PRM15/tE polypeptides of the invention is the ability to induce or promote an immune response to one or more dengue viruses of one or more serotypes that is about equal to or even greater than the immune response induced or promoted against such one or dengue viruses of one or more serotypes, respectively, by a PRM15/tE polypeptide comprising a wild-type dengue virus PRM15/tE polypeptide (e.g., SEQ ID NOS:149-152). Preferably, the PRM15/tE polypeptide is capable of inducing or promoting an immune response against at least one dengue virus of at least one serotype that is at least equal to or greater than that induced or promoted against the at least one dengue virus of the at least one serotype by a wild-type dengue virus PRM15/tE antigenic polypeptide of any virus serotype (e.g., more than any of SEQ ID NOS:149-152). Preferably, the polypeptide of the invention induces an immune response in a subject to at least one dengue virus of at least two, more preferably at least three, even more preferably at least four, virus serotypes, in a mammalian cell that is equal to or greater than the immune response induced against said at least one dengue virus of at least two, at least three, at least four virus serotypes in a subject by wild-type dengue virus PRM15/tE polypeptide of the respective serotype.

Moreover, some such PRM15/tE polypeptides also or alternatively can induce or promote an immune response in vivo or ex vivo against at least one dengue virus of all four serotypes in a subject that is at least about equal to or greater than the immune response induced or promoted against at least one dengue virus of all four serotypes by the combination of wild-type dengue virus PRM15/tE antigenic polypeptides of all four serotypes (e.g., equal to or greater than the combination of SEQ ID NOS:149-152).

An advantage of C15/full prM/full E polypeptides of the invention is the ability to induce or promote an immune response to at least one dengue virus of at least one serotype that is about equal to or even greater than the immune response induced or promoted against at least one dengue virus of the at least one serotype by a wild-type dengue virus C15/full prM/full E polypeptide (e.g., SEQ ID NOS:227-230). Preferably, a C15/full prM/full E polypeptide induces an immune response in a subject to at least one dengue virus of at least two, more preferably at least three, even more preferably at least four, virus serotypes, in a subject that is equal to or greater than the immune response induced serotypes in the subject against said at least one dengue virus of at least two, at least three, at least four virus cell by wild-type dengue virus C15/full prM/full E polypeptide of the respective serotype.

In one aspect, a C15/full prM/full E is capable of inducing or promoting an immune response against at least one dengue virus of at least one serotype that is greater than that induced or promoted against the respective dengue virus serotype(s) by a C15/full prM/full E antigenic polypeptide of any serotype (e.g., >than any of SEQ ID NOS:227-230).

Moreover, some such C15/full prM/full E PRM15/tE polypeptides also or alternatively induce or promote an immune response against at least one dengue virus of at least one serotype in vivo in a subject or ex vivo in a population of cells of a subject that is at least about equal to or greater than the immune response against the at least one dengue virus of at least one serotype induced or promoted by the combination of C15/full prM/full E antigenic polypeptides of all four dengue virus serotypes (e.g., equal to or greater than the combination of SEQ ID NOS:227-230).

Some C15/full prM/full E polypeptides of the invention which form VLPs induce an immune response in a subject against at least one dengue virus of at least one serotype that is about equal to or greater than that induced against the at least one dengue virus of the at least one serotype by an incomplete wild-type dengue virus truncated capsid/full prM/full E VLP (e.g., a VLP formed from wild-type dengue virus C15/full prM/full E polypeptides), an inactivated dengue virus particle, or both. Select C15/full prM/full E polypeptides also may be able to induce an immune response to at least one dengue virus of at least one serotype a dengue virus that is at least equal to or greater than the immune response induced to at least one dengue virus of at least one serotype by an attenuated WT dengue virus in a subject.

Especially advantageous are truncated E polypeptides, PRM15/tE polypeptides, or C15/full prM/full E polypeptides of the invention that promote an immune response to at least one dengue virus of all four known serotypes that is about equal to or greater than the immune response promoted by one of the corresponding wild-type dengue virus truncated E polypeptide, PRM15/tE polypeptide (e.g., SEQ ID NOS:149-152), or C15/full prM/full E polypeptide (e.g., SEQ ID NOS:227-230) of the same serotype or a combination of such wild-type polypeptides of all four serotypes. The invention further provides a PRM15/tE polypeptide that promotes a greater immune response to one or more dengue viruses of all four serotypes than is induced by a wild-type dengue virus PRM15/tE of any serotype (e.g., induces a greater immune response than that induced by a polypeptide consisting essentially of any of SEQ ID NOS:149-152) or combination thereof. The invention also provides a C15/full prM/full E polypeptide that promotes a greater immune response to one or more dengue viruses of all four serotypes than is induced by a wild-type dengue virus C15/full prM/full E of any serotype (e.g., induces a greater immune response than that induced by a polypeptide consisting essentially of any of SEQ ID NOS:227-330) or combination thereof.

Immune responses generated or induced by the polypeptides of the invention can be measured by any suitable technique. Examples of useful techniques in assessing humoral immune responses include flow cytometry, immunoblotting (detecting membrane-bound proteins), including dot blotting, immunohistochemistry (cell or tissue staining), enzyme immunoassays, immunoprecipitation, immunohistochemistry, RIA (radioimmunoassay), and other EIAs (enzyme immunoassays), such as ELISA (enzyme-linked immunosorbent assay—including sandwich ELISA and competitive ELISA) and ELIFA (enzyme-linked immunoflow assay). ELISA assays involve the reaction of a specific first antibody with an antigen. The resulting first antibody-antigen complex is detected by using a second antibody against the first antibody; the second antibody is enzyme-labeled and an enzyme-mediated color reaction is produced by reaction with the first antibody. Suitable antibody labels for such assays include radioisotopes; enzymes, such as horseradish peroxidase (HRP) and alkaline phosphatase (AP); biotin; and fluorescent dyes, such as fluorescein or rhodamine. Both direct and indirect immunoassays can be used in this respect. HPLC and capillary electrophoresis (CE) also can be utilized in immunoassays to detect complexes of antibodies and target substances. General guidance performing such techniques and related principles are described in, e.g., Harlow and Lane (1988) ANTIBODIES, A LABORATORY MANUAL, Cold Spring Harbor Publications, New York, Hampton R et al. (1990) SEROLOGICAL METHODS A LABORATORY MANUAL, APS Press, St. Paul Minn., Stevens (1995) CLINICAL IMMUNOLOGY AND SEROLOGY: A LABORATORY PERSPECTIVE, CRC press, Bjerrum (1988) HANDBOOK OF IMMUNOBLOTTTNG OF PROTEINS, Vol. 2, Zoa (1995) DIAGNOSTIC IMMUNOPATHOLOGY: LABORATORY PRACTICE AND CLINICAL APPLICATION, Cambridge University Press, Folds (1998) CLINICAL DIAGNOSTIC IMMUNOLOGY: PROTOCOLS IN QUALITY ASSURANCE AND STANDARDIZATION, Blackwell Science Inc., Bryant (1992) LABORATORY IMMUNOLOGY & SEROLOGY 3rd edition, W B Saunders Co., and Maddox D E et al. (1983) J. Exp. Med. 158:1211. Specific guidance with respect to ELISA techniques and related principles are described in, e.g., Reen (1994) Methods Mol. Biol. 32:461-6, Goldberg et al. (1993) Curr Opin Immunol 5(2):278-81, Voller et al. (1982) Lab Res Methods Biol Med 5:59-81, Yolken et al. (1983) Ann N Y Acad Sci 420:381-90, Vaughn et al. (1999) Am J Trop Med Hvg 60(4):693-8, and Kuno et al. J Virol Methods (1991) 33(1-2):101-13. Guidance with respect to Western blot techniques can found in, e.g., Ausubel et al., CURRENT PROTOCOLS IN MOLECULAR BIOLOGY (Wiley Interscience Publishers 1995). Specific exemplary applications of Western blot techniques can be found in, e.g., Churdboonchart et al. (1990) Southeast Asian J Trop Med Public Health 21(4):614-20 and Dennis-Sykes et al. (1985) J Biol Stand 13(4):309-14. Specific guidance with respect to flow cytometry techniques is provided in, e.g., Diamond (2000) IN LIVING COLOR: PROTOCOLS IN FLOW CYTOMETRY AND CELL SORTING, Springer Verlag, Jaroszeki (1998) FLOW CYTOMETRY PROTOCOLS, 1st Ed., Shapiro (1995) PRACTICAL FLOW CYTOMETRY, 3rd edition, Rieseberg et al. (2001) Appl Microbiol Biotechnol 56(3-4):350-60, Scheffold and Kern (2000) J Clin Immunol 20(6):400-7, and McSharry (1994) Clin Microbiol Rev (4):576-604.

Briefly, a Western blot assay may be performed by attaching a recombinant dengue antigen, such as a recombinant polypeptide of the invention, to a nitrocellulose paper and staining with an antibody which has a dye attached. Among the methods using a reporter enzyme is the use of a reporter-labeled antihuman antibody. The label may be an enzyme, thus providing an enzyme-linked immunosorbent assay (ELISA). It also may be a radioactive element, thus providing a radioimmunoassay (RIA).

Cytotoxic and other T cell immune responses also can be measured by any suitable technique. Examples of such techniques include ELISpot assay (particularly, IFN-gamma ELISpot), intracellular cytokine staining (ICC) (particularly in combination with FACS analysis), CD8+ T cell tetramer staining/FACS, standard and modified T cell proliferation assays, chromium release CTL assay, limiting dilution analysis (LDA), and CTL killing assays. Guidance and principles related to T cell proliferation assays are described in, e.g., Plebanski and Burtles (1994) J Immunol Meth 170:15, Sprent et al. (2000) Philos Trans R Soc Lond B Biol Sci 355(1395):317-22, and Messele et al. (2000) Clin Diagn Lab Immunol 7(4):687-92. LDA is described in, e.g., Sharrock et al. (1990) Immunol Today 11:281-286. ELISpot assays and related principles are described in, e.g., Czerinsky et al. (1988) J Immunol Meth 110:29-36, Olsson et al. (1990) J Clin. Invest 86:981-985, Schmittel et al. (2001) J Immunol Meth 247(1-2):17-24, Ogg and McMichael (1999) Immunol Lett 66(1-3):77-80, Schmittel et al. (2001) J Immunol Meth 247(1-2):17-24, Kurane et al. (1989) J Exp Med 170(3):763-75, Chain et al. (1987) J Immunol Meth 99(2):221-8, and Czerkinsky et al. (1988) J Immunol Meth, 110:29-36, as well as U.S. Pat. Nos. 5,750,356 and 6,218,132. Tetramer assays are discussed in, e.g., Skinner et al. (2000) J Immunol 165(2):613-7. Other T cell analytical techniques are described in Hartel et al. (1999) Scand J Immunol 49(6):649-54 and Parish et al. (1983) J Immunol Meth 58(1-2):225-37.

T cell activation also can be analyzed by measuring CTL activity or expression of activation antigens such as IL-2 receptor, CD69 or HLA-DR molecules. Proliferation of purified T cells can be measured in a mixed lymphocyte culture (MLC) assay. MLC assays are known in the art. Briefly, a mixed lymphocyte reaction (MLR) is performed using irradiated PBMC as stimulator cells and allogeneic PBMC as responders. Stimulator cells are irradiated (2500 rads) and co-cultured with allogeneic PBMC (1×10⁵ cells/well) in 96-well flat-bottomed microtiter culture plates (VWR) at 1:1 ratio for a total of 5 days. During the last 8 hours of the culture period, the cells were pulsed with 1 uCi/well of ³H-thymidine, and the cells are harvested for counting onto filter paper by a cell harvester as described above. ³H-thymidine incorporation is measured by standard techniques. Proliferation of T cells in such assays is expressed as the mean cpm read for the tested wells.

ELISpot assays measure the number of T-cells secreting a specific cytokine, such as interferon-gamma or tumor necrosis factor-alpha, that serves as a marker of T-cell effectors. Cytokine-specific ELISA kits are commercially available (e.g., an IFN-gamma-specific ELISPot is available through R&D Systems, Minneapolis, Minn.). ELISpot assays are further described in the Examples section (these and other assays are described in, e.g., Examples 23-26).

Where a particular recombinant polypeptide of the invention (e.g., recombinant truncated E polypeptide, full E polypeptide, PRM15/tE polypeptide, full prM/full E polypeptide, full prM/tE polypeptide, or C15/full prM/full E polypeptide) induces an approximately equal or greater immune response to a dengue virus of a particular serotype than does a corresponding wild-type dengue virus (e.g., truncated E polypeptide, full E polypeptide, PRM15/tE polypeptide, full prM/full E polypeptide, full prM/tE polypeptide, or C15/full prM/full E polypeptide, respectively), such approximately equal or greater immune response is typically an approximately equal or greater humoral immune response. That is, e.g., at least an about equal or higher titer of neutralizing antibodies to the virus serotype is produced in response to the recombinant polypeptide than is produced by a corresponding wild-type dengue virus polypeptide (e.g., a wild-type dengue virus truncated E, full E, PRM15/tE, full prM/full E, full prM/tE, or C15/full prM/full E polypeptide). The particular recombinant truncated E, full E, PRM15/tE polypeptide, full prM/full E, full prM/tE, C15/full prM/full E polypeptide of the invention desirably induces a T cell response in mammalian cells or a mammal that is substantially similar to (e.g., is at least about 80%, at least about 85%, at least about 90%, at least about 95%, or about 100% effective as) the T cell response attendant the administration or expression of the corresponding wild-type dengue virus truncated E, full E, PRM15/tE, full prM/full E, full prM/tE, or C15/full prM/full E polypeptide in the mammalian cells or mammal.

Alternatively, where a T cell response is desired, the recombinant truncated E polypeptide or full E polypeptide, PRM15/tE polypeptide, or C15/full prM/full E polypeptide can be administered or expressed with one or more corresponding wild-type dengue virus truncated E polypeptide or full E polypeptide, PRM15/tE polypeptide, or C15/full prM/full E polypeptide dengue prM, E, and/or fragments thereof, respectively (e.g., amino acid sequences comprising known T cell epitopes of such polypeptides), in the mammal. To retain a T cell response, the recombinant polypeptide desirably comprises one or more polypeptide fragments of the capsid protein, prM protein, and E protein of wild-type DEN-1, DEN-2, DEN-3, and/or DEN-4 of at least about 8 amino acids in length, and typically about 8 to about 25 amino acids (e.g., about 10 to about 20 amino acids) in length, wherein such one or more polypeptide fragments include a T cell epitope of a wild-type dengue virus C protein, prM protein, or E protein. In one preferred aspect, a recombinant PRM15/tE polypeptide comprises T cell epitope sequences, or variants or mutants of such T cell epitope sequences, that are observed in wild-type dengue virus PRM15 peptides and/or E proteins of two or more (e.g., multiple) serotypes, and preferably three or four serotypes. In one preferred aspect, a recombinant C15/full prM/full E polypeptide comprises T cell epitope sequences, or variants or mutants of such T cell epitope sequences, that are observed in wild-type dengue virus C15 peptides, prM proteins, and/or E proteins of two or more (e.g., multiple) serotypes, and preferably three or four serotypes.

A preferred characteristic of a recombinant polypeptide of the invention (including, e.g., a tE or full E polypeptide, PRM15/tE polypeptide, or C15/full prM/full E polypeptide) is the ability to induce a neutralizing antibody response against a dengue virus in a subject. A neutralizing antibody response can be determined, e.g., by a plaque reduction neutralization titer (PRNT) assay. Any suitable PRNT assay can be used to determine whether a polypeptide (or polynucleotide expressing such a polypeptide) induces one or more neutralizing antibodies against one or more dengue viruses of one or more serotypes. An exemplary plaque reduction neutralization titer assay for dengue viruses is described in Russell et al., J Immunol (1967) 99:285-290, which is incorporated herein by reference in its entirety for all purposes. Other PRNT methods and formats are well known to those of ordinary skill in the art. The results of such an assay depend on the selected level of neutralization desired. A PRNT₅₀, for example, is the highest serum dilution tested that reduces the number of plaque forming units (p.f.u.) by at least 50%. Typically, inverse PRNT scores are reported (see the Examples section below for further details on performing and analyzing the results of such assays). Favorably, as shown herein, select recombinant polypeptides of the invention (and polynucleotides of the invention expressing such polypeptides) are capable of inducing a neutralizing antibody response against dengue viruses of at least one or at least two serotypes in a subject. Advantageously, select recombinant polypeptides of the invention (and polynucleotides of the invention expressing such polypeptides) induce a neutralizing antibody response against one or more dengue viruses of each of at least three serotypes. As shown herein, some such recombinant polypeptides of the invention induce a neutralizing antibody response against one or more dengue viruses of each of the four known wild-type dengue virus serotypes in the subject, such as a mammal.

In addition to inducing a neutralizing antibody response, a recombinant PRM15/tE polypeptide of the invention also advantageously induces the production of an equal or higher titer of neutralizing antibodies against at least one dengue virus of at least one serotype than is induced against the at least one dengue virus of the at least one serotype by at least one wild-type dengue virus PRM15/tE polypeptide of the corresponding serotype (e.g., at least one of SEQ ID NOS:149-152). Such polypeptides also or alternatively induce a higher titer of neutralizing antibodies against at least one dengue virus of at least one serotype than is induced against at least one dengue virus of at least one serotype by a combination of wild-type dengue virus PRM15/tE polypeptides of all four known serotypes (e.g., a higher titer of neutralizing antibodies than a combination of four polypeptides separately consisting essentially of SEQ ID NOS:149-152). In one aspect, the invention provides a PRM15/tE polypeptide that induces a higher titer of neutralizing antibodies against at least one dengue virus of each of the four serotypes in a subject than is induced by a WT dengue virus PRM15/tE polypeptide of any virus serotype (e.g., a higher titer than a polypeptide consisting essentially of any one of SEQ ID NOS:149-152).

The invention also provides PRM15/tE polypeptides that are capable of inducing an equal or higher titer of neutralizing antibodies against at least one dengue virus of each of at least two virus serotypes, preferably at least three virus serotypes, more preferably at least four virus serotypes, than can be induced antibodies against at least one dengue virus of each of at least two virus serotypes, preferably at least three virus serotypes, more preferably at least four virus serotypes by a WT dengue virus PRM15/tE polypeptide of each of at least two, at least three, or at least four known serotypes (e.g., at least two, three, or four of SEQ ID NOS:149-152 or a polypeptide consisting essentially of one of SEQ ID NOS:149-152).

In another aspect, in addition to inducing a neutralizing antibody response, a recombinant C15/full prM/full E polypeptide of the invention also advantageously induces the production of an equal or higher titer of neutralizing antibodies against at least one dengue virus of at least one serotype than is induced against the at least one dengue virus of the at least one serotype by at least one wild-type dengue virus C15/full prM/full E polypeptide of the corresponding serotype (e.g., at least one of SEQ ID NOS:227-230). Such C15/full prM/full E polypeptides also or alternatively induce a higher titer of neutralizing antibodies against at least one dengue virus of at least one serotype than is induced against at least one dengue virus of at least one serotype by a combination of wild-type dengue virus C15/full prM/full E polypeptides of all four known serotypes (e.g., a higher titer of neutralizing antibodies than a combination of four polypeptides separately consisting essentially of SEQ ID NOS:227-230). In one aspect, the invention provides a C15/full prM/full E polypeptide that induces a higher titer of neutralizing antibodies against at least one dengue virus of each of the four serotypes in a subject than is induced by a WT dengue virus C15/full prM/full E polypeptide of any virus serotype (e.g., a higher titer than a polypeptide consisting essentially of any one of SEQ ID NOS:227-230).

The invention also provides recombinant C15/full prM/full E polypeptides that are capable of inducing an equal or higher titer of neutralizing antibodies against at least one dengue virus of each of at least two virus serotypes, preferably at least three virus serotypes, more preferably at least four virus serotypes, than can be induced antibodies against at least one dengue virus of each of at least two virus serotypes, preferably at least three virus serotypes, more preferably at least four virus serotypes by a wild-type dengue virus C15/full prM/full E polypeptide of each of at least two, at least three, or at least four known serotypes (e.g., at least two, three, or four of SEQ ID NOS:227-230 or a polypeptide consisting essentially of one of SEQ ID NOS:227-230).

A particularly advantageous feature attendant polypeptides of the invention is the ability to induce a neutralizing antibody response to dengue viruses of all four known virus serotypes in a subject. An exceptionally beneficial attribute of PRM15tE and C15/full prM/full E polypeptides of the invention is the ability to induce a higher level of neutralizing antibodies in a subject against one or more dengue viruses of all four serotypes than the level of neutralizing antibodies induced against such one or more dengue viruses of all four serotypes by a corresponding wild-type dengue virus polypeptide (e.g., wild-type PRM15/tE or wild-type C15/full prM/full E polypeptide) of all four serotypes (e.g., SEQ ID NOS:19-152 and 227-330, respectively), or combination of such wild-type polypeptides.

A particularly desirable characteristic of recombinant polypeptides of the invention is the ability to induce a neutralizing antibody response to at least one dengue virus, more preferably to at least two dengue viruses of multiple serotypes, and most preferably to dengue viruses of all four virus serotypes, in an animal, including a vertebrate, such as, e.g., a mammal. Thus, for example, serum taken from an animal (e.g., mammal) to which an immunogenic or antigenic amount of a recombinant polypeptide of the invention was administered (or to which an amount of a recombinant polynucleotide of the invention was administered sufficient to express an immunogenic or antigenic amount of the recombinant polypeptide) or in which an immunogenic or antigenic amount of a polypeptide of the invention was expressed, diluted at least about 30 fold, preferably at least about 40 fold or at least about 50 fold, and more preferably at least about 60 fold (e.g., at least about 70 fold, 80 fold, or higher), exhibits a neutralizing antibody response against about 50% of the dengue viruses in a sample of dengue viruses of at least three virus serotypes subjected to a plaque reduction neutralization titer assay.

An “antigenic amount” is an amount of an antigen, e.g., a polypeptide antigen or polynucleotide encoding such polypeptide antigen, that is sufficient to induce, promote, enhance, or modulate an immune response or immune reaction in cells in vitro, and/or in vivo in a subject or ex vivo in a subject's cells or tissues. An antigenic amount of a polypeptide may be produced by, e.g., administration or delivery of an antigenic amount of the polypeptide itself, or by administration or delivery of a polynucleotide that encodes an antigenic amount of such polypeptide.

In another aspect, a recombinant polypeptide of the invention induces a neutralizing antibody response to at least one dengue virus of all four dengue virus serotypes in a mammal to which such recombinant polypeptide (or polynucleotide encoding such recombinant polypeptide) is administered without an occurrence of antibody-dependent enhancement (ADE) upon infection of the mammal with the dengue virus. More particularly, the invention provides recombinant polypeptides, wherein serum obtained from an animal (e.g., mammal) to which had been administered an antigenic or immunogenic amount of at least one such recombinant polypeptide, or at least one recombinant nucleic acid or vector that encodes and/or expresses such an antigenic or immunogenic amount of such recombinant polypeptide of the invention, diluted at least about 40 fold, about 50 fold, or about 60 fold, and preferably at least about 70 fold or about 80 fold or more, neutralizes at least about 30%, at least about 40%, %, at least about 45%, at least about 50%, at least about 55%, at least about 60%, or more of a sample comprising at least one dengue virus of one or more of the four virus serotypes in a plaque reduction neutralization titer assay.

In one aspect, the invention provides recombinant polypeptides that exhibit a reciprocal PRNT₅₀ score of at least about 40, about 50, about 60, or about 70 or higher (e.g., about 40 to about 100, about 40 to about 80, about 60 to about 80, or about 70 to about 100) against one or more dengue viruses of each of two or more and preferably each of all four dengue virus serotypes. In one preferred aspect, the serum from a subject comprising such a level of polypeptide (either through expression or administration), diluted at least about 20 fold, 40 fold, 60 fold, 70 fold or 80 fold, neutralizes at least about 50% of a sample of at least one flavivirus in a plaque reduction neutralization titer (PRNT) assay; in one aspect, one of such dilutions neutralizes at least about 50% of a sample of a dengue virus of each of the four virus serotypes in a PRNT assay.

In one aspect of the invention, polypeptides of the invention that induce a neutralizing antibody response to at least one dengue virus of all four virus serotypes desirably do so without exhibiting a disproportionate level of neutralizing antibody response to dengue viruses of one serotype such that the immune response or protection against one or more dengue viruses of other serotypes is compromised (i.e., masked by immunodominance). In this respect, reciprocal PRNT₅₀ scores against one or more dengue viruses of each of all four serotypes for sera obtained from a subject comprising an antigenic amount of at least one recombinant polypeptide of the invention may desirably be within a range such that the highest reciprocal PRNT₅₀ score is less than about 4× the lowest reciprocal PRNT₅₀ score (e.g., less than about 3.9×, about 3.8×, about 3.5×, about 3.4×, or about 3.3× the lowest score), less than about 3× the lowest reciprocal PRNT₅₀ (e.g., less than about 2.9×, about 2.8×, 2.5× about 2.4×, or about 2.3× the lowest score), and preferably less than about 2× the lowest reciprocal PRNT₅₀ score (e.g., less than about 1.9×, about 1.8×, about 1.7×, about 1.6×, about 1.5×, about 1.4×, about 1.3×, about 1.2×, about 1.1× the lowest score), or from about 4× or less to about 1.5× or less the lowest reciprocal PRNT₅₀ score. Also provided is a composition comprising a mixture of two or more recombinant polypeptides of two or more serotypes, wherein said composition does not show a level of immunodominance against one or more dengue viruses of serotypes not included in the composition.

The recombinant polypeptide(s) of the invention can be either a secreted or cell membrane bound (or associated) polypeptide(s). In many instances, the recombinant polypeptide desirably is a secreted polypeptide. For example, in one aspect, some secreted recombinant PRM15/tE polypeptides of the invention surprisingly are secreted more efficiently than a wild-type dengue virus PRM15/tE polypeptide of at least one virus serotype (e.g., more efficiently than a polypeptide comprising or consisting essentially of at least one of SEQ ID NOS:149-152) and more preferably more efficiently than a wild-type PRM15/tE polypeptide of any of the four virus serotypes (e.g., more efficiently than a polypeptide comprising or consisting essentially of any of SEQ ID NOS:149-152). Some secreted recombinant C15/full prM/t E polypeptides of the invention are secreted more efficiently than a WT dengue virus C15/full prM/t E polypeptide of at least one virus serotype and preferably more efficiently than a WT C15/full prM/t E polypeptide of any of the four virus serotypes.

Some secreted recombinant C15/full prM/full E polypeptides of the invention are secreted more efficiently than a WT dengue virus C15/full prM/full E polypeptide of at least one virus serotype (e.g., more efficiently than a polypeptide comprising or consisting essentially of at least one of SEQ ID NOS:227-230) and preferably more efficiently than a WT C15/full prM/full E polypeptide of any of the four virus serotypes (e.g., more efficiently than a polypeptide comprising or consisting essentially of any of SEQ ID NOS:227-230).

Analysis of polypeptide or protein secretion can be performed by any suitable technique. For example, secretion levels can be determined by comparing the results of a Western blots/immunoblots performed with the supernatant of cells transfected with polynucleotides encoding such polypeptides and similar supernatants obtained from cells transfected with polynucleotides expressing corresponding WT dengue virus, where both such recombinant and WT polypeptides are expressed from a substantially identical expression cassette (e.g., an expression cassette comprising or consisting essentially of an identical promoter, enhancer, and polyA region sequences). See, e.g., a pMaxVax10.1 vector described below. The use of such a technique to analyze protein secretion is provided in the Examples below.

Measuring the expression level of a recombinant polypeptide of the invention (or a corresponding wild-type virus polypeptide for comparative purposes) can be carried out by any suitable technique. Examples of such techniques include Northern Blot analysis (discussed in, e.g., McMaster et al. (1997) Proc Natl Acad Sci USA 74:4835-38 (1977) and Sambrook, infra), reverse transcriptase-polymerase chain reaction (RT-PCR) (as described in, e.g., U.S. Pat. No. 5,601,820 and Zaheer et al. (1995) Neurochem Res 20:1457-63, and in situ hybridization techniques (as described in, e.g., U.S. Pat. Nos. 5,750,340 and 5,506,098). Quantification of proteins also can be accomplished by the Lowry assay and other classification protein quantification assays (see, e.g., Bradford (1976) Anal Biochem 72: 248-254 and Lowry et al. (1951) J Biol Chem 193:265). Western blot analysis of recombinant polypeptides of the invention obtained from the lysate of cells transfected with polynucleotides encoding such recombinant polypeptides is a preferred technique for assessing levels of recombinant polypeptide expression. The use of such a technique to assess recombinant polypeptide expression levels (and wild-type polypeptide expression levels for comparative purposes) is provided in the Examples below.

A particularly beneficial characteristic of polypeptides of the invention, and nucleic acids encoding such polypeptides, is the ability to induce a protective immune response in a subject, such as an animal, e.g., a mammal (including a primate), against challenge with at least one dengue virus of at least one serotype. Even more favorably, a polypeptide of the invention induces a protective immune response in a subject against challenge with one or more dengue viruses of each of at least two virus, at least three, and even at least four virus serotypes.

The induction of a protective immune response is determined, for example, by the lack of a disease condition(s) or symptom in a subject upon or following infection with the at least one dengue virus of the at least one serotype. In a mouse model, for example, induction of a protective immune response protects the mice against death usually seen after injection with a dengue virus. Such mouse models are regularly used for dengue virus vaccine testing (see, e.g., Johnson and Roehrig, J. Virol. 73(1):783-6 (1999)), although higher primate testing is preferred (e.g., a rhesus monkey model). In a human, induction of a protective immune response occurs when there is a detectable lessening, and preferably a complete non-occurrence of DF and/or DHF, upon infection with a dengue virus (preferably even after repeated infection with at least one dengue virus of each of multiple virus serotypes). Typically, though not necessarily, a protective polypeptide of the invention also induces the production of neutralizing antibodies against one or more dengue viruses or multiple virus serotypes (e.g., two, three or four serotypes).

The polypeptides of the invention can comprise any suitable combination of the above-described characteristics. For example, in one aspect, the invention provides a polypeptide (e.g., recombinant truncated E polypeptide) comprising an amino acid sequence that exhibits at least about 65%, preferably at least about 75% (e.g., about 80-95%) identity to at least one amino acid sequence selected from the group of SEQ ID NOS:1-49 and 153-155, wherein the polypeptide induces production of one or more antibodies that bind to one or more dengue viruses of each of the four virus serotypes in a subject more efficiently than an antibody induced by a WT truncated E protein polypeptide of the corresponding serotype.

In another aspect, the invention provides a polypeptide (e.g., recombinant or synthetic PRM15/tE polypeptide) comprising an amino acid sequence that exhibits at least about 65%, at least about 75%, preferably at least about 85%, 90%, or 95% amino acid sequence identity to at least one sequence selected from any of SEQ ID NOS:65-116, wherein the polypeptide induces production of one or more antibodies that bind to one or more dengue viruses of each of the four serotype more efficiently than at least one antibodies induced by any WT dengue virus polypeptides selected from SEQ ID NOS:149-152.

In another aspect, the invention provides a polypeptide (e.g., recombinant C15/full prM/full E polypeptide) comprising an amino acid sequence that exhibits at least about 65%, at least about 75% (e.g., about 80-95%), and/or preferably at least about 85% or 90% identity to at least one amino acid sequence selected from the group of SEQ ID NOS:39-148, and 236-253 (or selected from the group of SEQ ID NOS:39-145, 147-148 and 236-253 or any other group comprising a combination of two or more of these polypeptides), wherein the polypeptide induces production of antibodies that bind to one or more dengue viruses of each of the four virus serotypes in a subject more efficiently than at least one antibodies induced by any WT dengue virus polypeptide selected from SEQ ID NOS:227-230.

Recombinant tE polypeptides are desirably associated or extended with an ER-targeting signal amino acid sequence, which typically has substantially sequence identity with a C-terminal flaviviral prM amino acid sequence, and preferably has substantially identity with (or is selected from) (e.g., having at least about 75%, 80%, 85%, 86%, 87%, 88% or 89%, preferably at least about 90%, 91%, 92%, 93%, or 94%, and more preferably at least about 95% (e.g., about 87-95%), 96% 97%, 98%, 99%, 99.5% sequence identity) a dengue virus PRM15 sequence or a novel homolog thereof (e.g., a sequence selected from any of SEQ ID NOS:52-64), as described above, thereby forming a signal peptide/tE polypeptide, such as, e.g., a PRM15/tE polypeptide (e.g., any of SEQ ID NOS:1-49 and 153-155). Such a recombinant tE polypeptide or PRM15/tE polypeptide can be extended with a C terminal E protein fragment polypeptide as described above, such as, e.g., a sequence substantially identical (e.g., having at least about 75%, 80%, 85%, 86%, 87%, 88% or 89%, preferably at least about 90%, 91%, 92%, 93%, or 94%, and more preferably at least about 95% (e.g., about 87-95%), 96% 97%, 98%, 99%, 99.5% sequence identity) with any of SEQ ID NOS:127-136, to form a recombinant full E polypeptide or PRM15/full E polypeptide. In addition or alternatively, a recombinant PRM15/tE polypeptide or a PRM15/full E polypeptide of the invention can be extended with an N terminal C15/truncated prM polypeptide as described above, such as, e.g., a sequence substantially identical (e.g., having at least about 75%, 80%, 85%, 86%, 87%, 88% or 89%, preferably at least about 90%, 91%, 92%, 93%, or 94%, and more preferably at least about 95% (e.g., about 87-95%), 96% 97%, 98%, 99%, 99.5% sequence identity) with any one of SEQ ID NOS:117-126, to form a recombinant C15/full prM/tE or C15/full prM/full E polypeptide, respectively. Some such recombinant polypeptides are expressed and/or secreted at higher levels than prM/E fusion proteins formed from WT DEN sequences, as shown above and in Examples below.

Some such recombinant tE, full E, PRM15/tE, PRM15/full E, C15/full prM/full E, and C15/full prM/tE polypeptides also desirably exhibit the ability to induce a neutralizing antibody response against at least one dengue virus of at least one serotype, and preferably against one or more dengue viruses of multiple serotypes, in a subject, e.g., mammalian host. As noted above, some such polypeptides induce a higher neutralizing antibody titer against at least one dengue virus of at least one serotype than is induced against such at least one dengue virus of at least one serotype by a wild-type dengue polypeptide of similar size and configuration of the same or similar serotype (e.g., wild-type dengue virus tE, full E, PRM15/tE, PRM15/full E, C15/full prM/full E, C15/full prM/tE), as described above. Some such recombinant tE, full E, PRM15/tE, PRM15/full E, C15/full prM/full E, and C15/full prM/tE polypeptides also are able to induce a protective immune response against challenge by at least one dengue virus of at least one serotype in a subject host, and preferably exhibit the ability to induce a protective immune response against challenge by at one two dengue virus of each of at least two serotypes, in a subject host. Some such recombinant tE, full E, PRM15/tE, PRM15/full E, C15/full prM/full E, and C15/full prM/tE polypeptides of the invention are able to induce a protective immune response against challenge by at least one dengue virus of each DEN-1, DEN-2, DEN-3, and DEN-4 in a subject host. Desirably, such polypeptides induce such neutralizing antibody responses and/or protective immune responses against one or more viruses of two or more dengue virus serotypes without the occurrence of ADE.

In another preferred aspect, the invention provides a recombinant tE polypeptide comprising an amino acid sequence that has at least about 65% amino acid sequence identity, at least about 75% (e.g., about 80-95%), preferably at least about 85%, or at least about 90% or at least about 95% amino acid sequence identity with at least one amino acid sequence selected from the group of SEQ ID NOS:1-49 and 153-155, wherein the polypeptide induces a neutralizing antibody response to one or more dengue viruses of at least two virus serotypes in a subject. Preferably, such polypeptides induce a neutralizing antibody response to one or more dengue viruses of each of at least three virus serotypes, and, even more preferably, against one or more dengue viruses of each of dengue-1, dengue-2, dengue-3, and dengue-4 serotypes in the subject. Optionally, such neutralizing antibody (Ab) production produces higher neutralizing Ab titers than are obtained with a corresponding wild-type truncated E of one or more virus serotypes. These polypeptides also can be associated with any of the above-described characteristics, or combinations thereof, attendant polypeptides of the invention (e.g., inclusion of an ER-targeting sequence, inclusion of at least one C terminal E protein fragment polypeptide and/or at least one N terminal C15/truncated prM polypeptide, higher secretion, higher expression, and induction of a protective immune response in a subject such as a mammal to one or more dengue viruses of multiple serotypes without induction of ADE). Such polypeptides exhibit one or more of the characteristics of the polypeptides of the invention (e.g., ability to induce an immune response against at least one dengue virus of one or more serotypes; ability to induce an immune response against at least one dengue virus of one or more serotypes that is greater than that induced by a corresponding WT dengue polypeptide; ability to induce the production of neutralizing antibodies to one or more dengue viruses of multiple serotypes in a subject) or any suitable combination thereof. Such polypeptides are useful in methods of the invention described herein, including methods of inducing an immune response against at least one dengue virus of at least one serotype, methods of inducing a protective immune response against a dengue virus, and/or methods of detecting the presence of antibodies against dengue viruses of one or more serotypes in a sample.

In another aspect of the invention, the polypeptide comprises an amino acid sequence that has substantially identity (e.g., having at least about 75%, 80%, 85%, 86%, 87%, 88% or 89%, preferably at least about 90%, 91%, 92%, 93%, or 94%, and more preferably at least about 95% (e.g., about 87-95%), 96% 97%, 98%, 99%, 99.5% sequence identity) with at least one of SEQ ID NOS:65-116. Preferred polypeptides comprise a sequence selected from any of SEQ ID NOS:65-116. Such polypeptides can comprise any of the above-described characteristics attendant polypeptides of the invention (e.g., ability to induce an immune response against at least one dengue virus of one or more serotypes; ability to induce an immune response against at least one dengue virus of one or more serotypes that is greater than that induced by a corresponding WT dengue polypeptide; ability to induce the production of neutralizing antibodies to one or more dengue viruses of multiple serotypes in a subject) or any suitable combination thereof, and are useful in methods of inducing an immune response against at least one dengue virus of at least one serotype, methods of inducing a protective immune response against a dengue virus, and/or methods of detecting or diagnosing the presence of antibodies against dengue viruses of one or more serotypes in a sample.

In yet another aspect, the invention provides a polypeptide comprising an amino acid sequence that has substantial sequence identity (e.g., at least about 65% amino acid sequence identity, desirably at least about 75% amino acid sequence identity, favorably at least about 80% or at least about 85% amino acid sequence identity, and preferably at least about 90% or at least about 95% amino acid sequence identity) with a sequence selected from any of SEQ ID NOS:139-148, 236-253, 343, and 345, or selected from any of SEQ ID NOS:139-145, 147-148, 236-253, 343, and 345. Preferred polypeptides in this aspect comprise a sequence selected from the group of SEQ ID NOS:139-148, 236-253, 343, and 345. Such polypeptides can comprise any of the above-described characteristics attendant polypeptides of the invention (e.g., ability to induce an immune response against at least one dengue virus of one or more serotypes; ability to induce an immune response against at least one dengue virus of one or more serotypes that is greater than that induced by a corresponding WT dengue polypeptide; ability to induce the production of neutralizing antibodies to one or more dengue viruses of multiple serotypes in a subject) or any suitable combination thereof, and are useful in methods of inducing an immune response against at least one dengue virus of at least one serotype, methods of inducing a protective immune response against a dengue virus, and/or methods of detecting or diagnosing the presence of antibodies against dengue viruses of one or more serotypes in a sample.

In another aspect, the invention provides a recombinant truncated E polypeptide encoded by a nucleic acid comprising a polynucleotide sequence selected from the group of: (a) a polynucleotide sequence having at least about 85% sequence identity to at least one polynucleotide sequence selected from the group of SEQ ID NOS:285-330 or a complementary polynucleotide sequence thereof; (b) a RNA polynucleotide sequence comprising a DNA sequence selected from the group of SEQ ID NOS:285-330 in which all of the thymine nucleotide residues in the DNA sequence are replaced with uracil nucleotide residues or a complementary RNA polynucleotide sequence thereof; (c) a RNA polynucleotide sequence that has at least about 85% sequence identity to at least one RNA polynucleotide sequence of (b) or a complementary RNA polynucleotide sequence thereof; (d) a polynucleotide sequence that hybridizes under at least stringent conditions over substantially the entire length of a polynucleotide sequence of (a)-(c); (e) a polynucleotide sequence which would hybridize under at least stringent conditions over substantially the entire length of a polynucleotide sequence of any of (a)-(d) but for the degeneracy of the genetic code; and (f) a polynucleotide sequence that possesses any combination of the features of the polynucleotide sequences of (a)-(e).

In yet another aspect, the invention provides a recombinant PRM15/tE polypeptide encoded by a nucleic acid comprising a polynucleotide sequence selected from the group of: (a) a polynucleotide sequence having at least about 85% sequence identity to at least one polynucleotide sequence selected from the group of SEQ ID NOS:156-200 and 235, or a complementary polynucleotide sequence thereof; (b) a RNA polynucleotide sequence comprising a DNA sequence selected from the group of SEQ ID NOS:156-200 and 235 in which all of the thymine nucleotide residues in the DNA sequence are replaced with uracil nucleotide residues or a complementary RNA polynucleotide sequence thereof; (c) a RNA polynucleotide sequence that has at least about 85%, 90%, or 95% sequence identity to at least one RNA polynucleotide sequence of (b) or a complementary RNA polynucleotide sequence thereof; (d) a polynucleotide sequence that hybridizes under at least stringent conditions over substantially the entire length of a polynucleotide sequence of (a)-(c); (e) a polynucleotide sequence which would hybridize under at least stringent conditions over substantially the entire length of a polynucleotide sequence of any of (a)-(d) but for the degeneracy of the genetic code; and (f) a polynucleotide sequence that possesses any combination of the features of the polynucleotide sequences of (a)-(e). Such polypeptides exhibit any of the above-described characteristics attendant polypeptides of the invention and are useful in methods of the invention, e.g., methods of inducing an immune response against at least one dengue virus of at least one serotype, and/or methods of detecting or diagnosing the presence of antibodies against dengue viruses of one or more serotypes in a sample.

In another aspect, the invention includes a recombinant C15/full prM/full E polypeptide encoded by a nucleic acid comprising a polynucleotide sequence selected from the group of: (a) a polynucleotide sequence having at least about 85%, 90%, or 95% sequence identity to at least one polynucleotide sequence selected from the group of SEQ ID NOS:201-210, 254-271, 342, and 344, or a complementary polynucleotide sequence thereof; (b) a RNA polynucleotide sequence comprising a DNA sequence selected from the group of SEQ ID NOS:201-210, 254-271, 342, and 344, in which all of the thymine nucleotide residues in the DNA sequence are replaced with uracil nucleotide residues or a complementary RNA polynucleotide sequence thereof; (c) a RNA polynucleotide sequence that has at least about 85% sequence identity to at least one RNA polynucleotide sequence of (b) or a complementary RNA polynucleotide sequence thereof; (d) a polynucleotide sequence that hybridizes under at least stringent conditions over substantially the entire length of a polynucleotide sequence of (a)-(c); (e) a polynucleotide sequence which would hybridize under at least stringent conditions over substantially the entire length of a polynucleotide sequence of any of (a)-(d) but for the degeneracy of the genetic code; and (f) a polynucleotide sequence that possesses any combination of the features of the polynucleotide sequences of (a)-(e). Such polypeptides exhibit any of the above-described characteristics attendant polypeptides of the invention and are useful in methods of the invention, e.g., methods of inducing an immune response against at least one dengue virus of at least one serotype, and/or methods of detecting or diagnosing the presence of antibodies against dengue viruses of one or more serotypes in a sample.

Recombinant polypeptides of the invention advantageously are capable of inducing an immune response to one or more dengue viruses of at least one, preferably at least two, more preferably at least three, and most preferably at least all four virus serotypes in a subject over sustained periods of time. For example, delivery or administration of an antigenic or immunogenic amount of at least one polypeptide of the invention to a subject induces a neutralizing antibody immune response to at least one dengue virus of at least one serotype for a period of at least about 30 days, at least about 40 days, desirably at least about 50, favorably at least about 70 or about 80 days, preferably at least about 100 days, more preferably at least about 120 days, and even more preferably at least about 180 days (e.g., about 3, 4, 6, or 9 months, about 1 year, 2 years, or longer) following delivery or administration of the at least one polypeptide to the subject.

In another aspect, delivery or administration of at least one polypeptide of the invention by delivery or administration of a suitable nucleic acid vector (e.g., a pMaxVax10.1 vector) comprising at least one polynucleotide encoding an antigenic or immunogenic amount of the at least one polypeptide, induces a neutralizing antibody immune response to at least one dengue virus of at least one serotype for a period of at least about 30 days, at least about 40 days, desirably at least about 50, favorably at least about 70 or about 80 days, preferably at least about 100 days, more preferably at least about 120 days, and even more preferably at least about 180 days (e.g., about 9 months, about 1 year, 2 years, or longer) after initial expression of the at least one polypeptide in a subject.

An immune response obtained by administration of a recombinant polypeptide of the invention (or polynucleotide or vector coding on expression for such a polypeptide, examples of which are further discussed herein), such as, e.g., a recombinant chimeric dengue virus PRM15/tE or C15/full prM/full E polypeptide, desirably lasts longer than the immune response induced by administration of a corresponding wild-type dengue virus polypeptide (e.g., wild-type dengue virus PRM15/tE or C15/full prM/full E polypeptide) or polynucleotide or vector coding on expression of such a wild-type dengue virus polypeptide.

In another aspect, the invention provides a recombinant or chimeric polypeptide including at least about 3, least about 5, at least about 6, at least about 7, at least about 8, at least about 10, at least about 15, at least about 20, at least about 25, at least about 30 or more amino acid (residue) sequence fragments of at least about 8 amino acid residues in length, at least about 10 amino acid residues in length, or at least about 15 amino acids in length, wherein such amino acid fragments are observed in one or more wild-type DEN-1, DEN-2, DEN-3, and DEN-4 prM/E amino acid sequences, and wherein such polypeptide comprises at least about 2 (preferably at least about 3, at least about 4, at least about 5, at least about 10, at least about 15, or more) non-contiguous fragments from any one wild-type dengue virus prM/E sequence. Such recombinant or chimeric polypeptides also or alternatively can promote or enhance an immune response, especially in the case of a humoral immune response, about equal to or greater than a recombinant or chimeric dengue virus antigen and/or flavivirus antigen consisting of 4 or less distinct dengue virus and/or flavivirus amino acid sequence fragments.

In one aspect, for example, the invention provides a recombinant or chimeric polypeptide comprising at least about 5 to about 20 amino acid sequence fragments comprising at least about 10 to about 40 amino acid residues in length, wherein such amino acid sequence fragments are non-contiguous fragments of one or more wild-type DEN-1, DEN-2, DEN-3, and DEN-4 prM/E amino acid sequences.

Nucleic Acids

The invention further provides a nucleic acid comprising a nucleotide sequence encoding any of the above-described polypeptides of the invention, including recombinant and chimeric tE, full length E, PRM15/tE, PRM15/full E, C15/full prM/full E, and C15/full prM/tE polypeptides of the invention. The terms “nucleic acid” and “polynucleotide” are synonymously used throughout in reference to such recombinant dengue antigen-encoding DNA, RNA, or other novel nucleic acid molecules of the invention, unless otherwise stated or clearly contradicted by context. The nucleic acid encoding a recombinant, synthetic, mutant, and/or isolated polypeptide of the invention can be any type of nucleic acid suitable for expressing a polypeptide of the invention (e.g., single stranded or double stranded RNA, DNA, or combinations thereof) and can include any suitable nucleotide base, base analog, and/or backbone (e.g., a backbone formed by, or including, a phosphothioate, rather than phosphodiester, linkage). Modifications to the nucleic acid are particularly tolerable in the 3rd position of an mRNA codon sequence encoding such a polypeptide. Examples of modified nucleotides that can be incorporated in the polynucleotide sequence are provided in, e.g., the MANUAL OF PATENT EXAMINING PROCEDURE §2422 (7th Revision—2000). Additional and alternative sequence modifications are described elsewhere herein. In some aspects, a nucleic acid of the invention may be an isolated nucleic acid. A nucleic acid of the invention may be termed a recombinant, synthetic, and/or mutant nucleic acid or polynucleotide; a recombinant, synthetic, and/or mutant nucleic acid or polynucleotide of the invention is often simply referred to as a “recombinant nucleic acid,” “recombinant polynucleotide,” or even more simply a “nucleic acid” or “polynucleotide.” A nucleic acid of the invention may be referred to as a recombinant or synthetic dengue antigen-encoding nucleic acid.

In another aspect, a recombinant nucleic acid of the invention comprises any nucleotide sequence that results in the production (e.g., suitable production) of a recombinant polypeptide of the invention upon expression or translation in a desired host cell. Suitable production of a recombinant polypeptide of the invention may be the expression of an immunogenic amount or antigenic amount of the polypeptide of the invention. An “immunogen” is a molecule that induces, promotes, enhances, or modulates an immune response (e.g., in an in vitro cell-based assay or in vivo in a subject to which the immunogen is administered or ex vivo in cells (e.g., a subject's cells or tissue) that are transplanted into the subject). In one particular aspect, an “immunogen” is an antigen that elicits an immune response. In certain particular aspects, an immunogen is an antigen that elicits a strong immune response, particularly in the context of protective immunity to at least one pathogen (e.g., at least one flavivirus of at least one serotype). An “immunogenic amount” is typically an amount of a molecule, e.g., a polypeptide or polynucleotide, that is sufficient to induce, promote, enhance, or modulate an immune response in cells in vitro, and/or in vivo in a subject or ex vivo in a subject's cells or tissue(s). In one aspect, an immunogenic amount refers to an amount of an antigen that elicits an immune response that provides at least partial or complete protective immunity against at least one pathogen, such as a flavivirus (e.g., such as against at least one dengue virus of one or more serotypes). An immunogenic amount of a polypeptide may be produced by, e.g., administration or delivery of an immunogenic amount of the polypeptide itself to cells in vitro or to a population of cells or tissue(s) of a subject ex vivo or in vivo, or by administration or delivery of a nucleic acid that encodes an immunogenic amount of such polypeptide to cells in vitro or to a population of cells or tissue(s) of a subject ex vivo or in vivo.

Nucleic acids of the invention encode polypeptides of the invention that have one or more of the properties described above. Nucleic acids of the invention are similarly useful in methods of the invention, including a method of inducing an immune response against at least one flavivirus of at least one serotype (e.g., preferably, dengue virus) in a subject that comprises contacting, administering, or delivering a nucleic acid encoding at least one polypeptide of the invention to such subject (or to population of cells of the subject) and methods of inducing a protective immune response against at least one flavivirus of one or more serotypes (e.g., preferably, at least one dengue virus of two or more serotypes) in a subject that comprises contacting, administering, or delivering a nucleic acid encoding at least one polypeptide of the invention to such subject (or to a population of cells thereof).

As such, a nucleic acid of the invention is not limited to a nucleotide sequence that directly codes for expression or production of a polypeptide of the invention. For example, the nucleic acid can comprise a nucleotide sequence which results in a polypeptide of the invention through intein-like expression (as described in, e.g., Colson and Davis (1994) Mol Microbiol 12(3): 959-63, Duan et al. (1997) Cell 89(4):555-64, Perler (1998) Cell 92(1):1-4, Evans et al (1999) Biopolymers 51(5):333-42, and de Grey, Trends Biotechnol (2000) 18(9):394-99), or a nucleotide sequence which comprises self-splicing introns (or other self-spliced RNA transcripts), which form an intermediate recombinant polypeptide-encoding sequence (as described in, e.g., U.S. Pat. No. 6,010,884). The polynucleotides also can comprise sequences which result in other splice modifications at the RNA level to produce an mRNA transcript encoding the polypeptide and/or at the DNA level by way of trans-splicing mechanisms prior to transcription (principles related to such mechanisms are described in, e.g., Chabot, Trends Genet (1996) 12(11):472-78, Cooper (1997) Am J Hum Genet 61(2):259-66, and Hertel et al. (1997) Curr Opin Cell Biol 9(3):350-57). Due to the inherent degeneracy of the genetic code, several nucleic acids can code for any particularly polypeptide of the invention. Thus, for example, any of the particular recombinant dengue-antigen-encoding nucleic acids described herein can be modified by replacement of one or more codons with an equivalent codon (with respect to the amino acid called for by the codon) based on genetic code degeneracy.

Polynucleotides of the invention can be obtained by application of any suitable synthesis, manipulation, and/or isolation techniques, or combinations thereof. For example, polynucleotides of the invention are typically and preferably produced through standard nucleic acid synthesis techniques, such as solid-phase synthesis techniques known in the art. In such techniques, fragments of up to about 100 bases usually are individually synthesized, then joined (e.g., by enzymatic or chemical ligation methods, or polymerase mediated recombination methods) to form essentially any desired continuous nucleic acid sequence. The synthesis of the nucleic acids of the invention can be also facilitated (or alternatively accomplished), by chemical synthesis using, e.g., the classical phosphoramidite method, which is described in, e.g., Beaucage et al. (1981) Tetrahedron Letters 22:1859-69, or the method described by Matthes et al. (1984) EMBO J 3:801-05, e.g., as is typically practiced in automated synthetic methods. The polynucleotide of the invention desirably by the phosphoramidite technique also preferably is accomplished by way of an automatic DNA synthesizer. Other techniques for synthesizing nucleic acids and related principles are described in, e.g., Itakura et al. (1984) Annu Rev Biochem 53:323, Itakura et al. (1984) Science 198:1056, and Ike et al. (1983) Nucl Acid Res 11:477.

Conveniently, custom made nucleic acids can be ordered from a variety of commercial sources, such as The Midland Certified Reagent Company (“mcrc” at “oligos” dot “com”), the Great American Gene Company (“http” colon, slash, slash, “www” dot “genco” dot “com”), ExpressGen Inc. (“www” dot “expressgen” dot “com”), Operon Technologies Inc. (Alameda, Calif.). Similarly, custom peptides and antibodies can be custom ordered from any of a variety of sources, e.g., PeptidoGenic (“pkim” at “ccnet” dot “com”), HTI Bio-products, Inc. (“http” colon, slash, slash, “htibio” dot “com”), and BMA Biomedicals Ltd. (U.K.), Bio. Synthesis, Inc.

Recombinant DNA techniques useful in modification of nucleic acids are well known in the art (e.g., restriction endonuclease digestion, ligation, reverse transcription and cDNA production, and PCR). Useful recombinant DNA technology techniques and principles related thereto are provided in, e.g., Mulligan (1993) Science 260:926-932, Friedman (1991) THERAPY FOR GENETIC DISEASES, Oxford University Press, Ibanez et al. (1991) EMBO J 10:2105-10, Ibanez et al. (1992) Cell 69: 329-41 (1992), and U.S. Pat. Nos. 4,440,859, 4,530,901, 4,582,800, 4,677,063, 4,678,751, 4,704,362, 4,710,463, 4,757,006, 4,766,075, and 4,810,648, and are more particularly described in Sambrook et al. (1989) MOLECULAR CLONING: A LABORATORY MANUAL, Cold Spring Harbor Press, and the third edition thereof (2001), Ausubel et al. (1994-1999), CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, Wiley Interscience Publishers (with Greene Publishing Associates for some editions), Berger and Kimmel, “Guide to Molecular Cloning Techniques” in Meth Enzymol, 152, Acad. Press, Inc. (San Diego, Calif.), and Watson et al., RECOMBINANT DNA (2d ed.).

As described in more detail herein, recombinant nucleic acids of the invention include: (1) polynucleotide sequences that encode any of the above-described polypeptide sequences and immunogenic or antigenic fragments thereof (including, e.g., recombinant immunogenic tE, full E, PRM15/tE, PRM15/full E, C15/full prM/full E, and C15/full prM/tE polypeptides of the invention), and complementary sequences thereof; (2) polynucleotide sequences complementary to such polynucleotide sequences of (1) (and fragments thereof); polynucleotides that hybridize under at least stringent conditions to the nucleic acid sequences defined herein, including any polynucleotide sequence of (1) and (2), and complementary sequences thereof; (3) novel immunogenic or antigenic fragments of all such nucleic acid sequences, including those described in (1) and (2) above, which have one or more of the immune-response inducing properties set forth herein or encode one or more polypeptides have one or more of the immune-response inducing properties set forth herein, and complementary sequences of such fragments; and (4) variants, analogs, and homologue derivatives of all of the above.

The polynucleotides of the invention can be double-stranded or single-stranded, and if single-stranded, can be the coding strand or the non-coding (i.e., antisense or complementary) strand. In addition to a nucleotide sequence encoding a polypeptide of the invention, the polynucleotide of the invention can comprise one or more additional coding nucleotide sequences, so as to encode, e.g., a fusion protein, a pre-protein, a prepro-protein, a heterologous transmembrane domain, targeting sequence (other than a signal sequence), or the like (more particular examples of which are discussed further herein), and/or can comprise non-coding nucleotide sequences, such as introns, or 5′ and/or 3′ untranslated regions effective for expression of the coding sequence in a suitable host.

For example, a polynucleotide of the invention that encodes a truncated E or full E protein (i.e., tE-encoding or full E-encoding polynucleotide) may further comprise a nucleotide sequence that encodes an ER-targeting signal sequence positioned near to, or fused to, the tE-encoding or full E-encoding immunogenic sequence of the invention. Preferably, the signal sequence-encoding nucleic acid sequence encodes a signal sequence having the preferred features of signal sequences described above (e.g., a 15-amino acid fragment of the C terminal of a prM protein of a flavivirus, preferably a dengue virus, or a sequence having substantial identity therewith).

In one aspect, the invention provides a nucleic acid comprising a signal sequence-encoding sequence having substantial nucleotide sequence identity (e.g., at least about 65%, 70%, 75%, preferably at least about 80% or 85%, and more preferably at least about 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity) with at least one of SEQ ID NOS:272-284, and preferably SEQ ID NOS:272-280. Such polynucleotide sequences, which encode signal peptides, are similarly useful as and in applications including the signal peptides of the invention discussed above. In one aspect, truncated E polypeptide-encoding or full length E polypeptide-encoding nucleic acids comprising a sequence selected from the group of SEQ ID NOS:272-284 are also a feature of the present invention.

The invention also provides novel nucleic acids useful in the production of recombinant dengue virus antigens and other applications (e.g., for use in methods of inducing an immune response against one or more dengue viruses and/or in therapeutic or prophylactic methods, as vaccines, in diagnostic methods and systems, as nucleic acid probes, in the amplification of smaller nucleic acid sequences that encode immunogenic fragments of such recombinant dengue virus antigens (such uses are discussed elsewhere herein)).

For example, in one respect, the invention provides a nucleic acid comprising a polynucleotide sequence that has substantial sequence identity (e.g., at least about 65%, 70%, 75%, preferably at least about 80% or 85%, and more preferably at least about 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity) with the polypeptide sequence of at least one of SEQ ID NOS:285-330. In particular aspects, the nucleic acid comprises a sequence selected from the group of SEQ ID NOS:285-330. Such a nucleic acid encodes at least a recombinant truncated E polypeptide of the invention and is generally termed a recombinant tE polypeptide-encoding nucleic acid. Such nucleic acids having one or more of the properties of recombinant nucleic acids described below.

Nucleic acids consisting of and/or consisting essentially of such sequences such as the group of SEQ ID NOS:285-330 encode a polypeptide of a length approximately equal to a truncated E sized recombinant dengue antigen of the invention, as discussed above. Such nucleic acids are typically at least about 1300 nucleotides in length, and typically are about 1300-1375 nucleotides in length (e.g., about 1340 nucleotides in length).

The invention also provides a nucleic acid comprising first nucleotide sequence encoding recombinant truncated E polypeptide dengue antigens and a second nucleotide sequence encoding a signal peptide. For example, in one aspect, the invention provides a nucleic acid comprising a sequence that has substantial identity (e.g., at least about 75%, 80%, 85%, 86%, 87%, 88% or 89%, preferably at least about 90%, 91%, 92%, 93%, or 94%, and more preferably at least about 95% (e.g., about 87-95%), 96% 97%, 98%, 99%, 99.5% sequence identity) with at least one of SEQ ID NOS:156-200 and 235. More desirably, the polynucleotide comprises a sequence selected from the group of SEQ ID NOS:156-200 and 235. Such nucleic acids are typically at least about 1350 nucleotides in length, and more typically are about 1350-1400 nucleotides in length (e.g., about 1385 nucleotides in length). Such a nucleic acid encodes a PRM15/t E polypeptide and is generally termed a PRM15/tE polypeptide-encoding nucleic acid. In particular aspects, the nucleic acid comprises a sequence selected from the group of SEQ ID NOS:157-159, 185, 187, 172, 200, and 235. Such nucleic acids having one or more of the properties of recombinant nucleic acids are described below.

The invention also provides a nucleic acid sequence that has substantial sequence identity (e.g., at least about 65%, 70%, 75%, preferably at least about 80% or 85%, and more preferably at least about 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity) with the polypeptide sequence of at least one of SEQ ID NOS:211-214. In particular aspects, the nucleic acid comprises a sequence selected from the group of SEQ ID NOS:211-214. These nucleotide sequences are human codon optimized nucleotide sequences encoding DEN-1, DEN-2, DEN-3, and DEN-4 PRM15/tE polypeptides, respectively, and are generally termed human CO WT PRM15/tE polypeptide-encoding nucleic acids. The polypeptides encoded by these nucleic acid sequences offer improved biological properties over non human codon-optimized (CO) wild-type PRM15/tE polypeptide-encoding nucleotide sequences. Such polynucleotides are markedly different in structure and lack substantial identity with non human CO wild-type PRM15/tE polypeptide-encoding nucleotide sequences. For example, the polypeptide encoded by these sequences are expressed at higher levels and/or are secreted at higher levels than similar PRM15/tE polypeptides expressed from non human CO WT dengue virus PRM15/tE sequences.

Also included is a nucleic acid sequence that has substantial sequence identity (e.g., at least about 75%, 80%, 85%, 86%, 87%, 88% or 89%, preferably at least about 90%, 91%, 92%, 93%, or 94%, and more preferably at least about 95% (e.g., about 87-95%), 96% 97%, 98%, 99%, 99.5% sequence identity) with the polypeptide sequence of at least one of SEQ ID NOS:215-218. Some such nucleic acid comprises a sequence of at least about 1800 nucleotides. In particular aspects, the nucleic acid comprises a sequence selected from the group of SEQ ID NOS:215-218. These nucleic acid sequences are human codon optimized nucleotide sequences that encode DEN-1, DEN-2, DEN-3, and DEN-4 C15/full prM/full E polypeptides, respectively, and are termed human CO WT C15/full prM/full E polypeptide-encoding nucleic acids. These encoded polypeptides exhibit improved biological properties compared to C15/full prM/full E polypeptides expressed from non human CO wild-type C15/full prM/full E polypeptide-encoding nucleic acids. For example, the polypeptides encoded by these sequences are expressed at higher levels and/or are secreted at higher levels than similar C15/full prM/full E polypeptides expressed from non human CO wild-type C15/full prM/full E polypeptide-encoding nucleic acid sequences.

The invention also provides a recombinant nucleic acids that encode, e.g., antigenic fusion proteins each comprising: (1) a C15 dengue virus signal sequence (which also includes a Met residue as the first residue of the signal sequence, thereby forming a 16-amino acid signal sequence); (2) a full length prM dengue virus sequence; and 3) a full length envelope (E) protein sequence, wherein these three sequences are fused together in the order 1, 2, and 3. In another aspect, the invention provides recombinant nucleic acids encoding antigenic fusion proteins that each comprise a full length prM dengue virus sequence fused to a full length envelope (E) protein sequence.

In another aspect, the invention provides a nucleic acid comprising at least a first polynucleotide sequence comprising a tE polypeptide-encoding polynucleotide sequence or a PRM15/tE polypeptide-encoding polynucleotide sequence and at least a second polynucleotide sequence that encodes a polypeptide sequence that has at least about 55%, preferably at least about 65%, and more preferably at least about 75% (e.g., at least about 80%, at least about 85%, at least about 90%, at least about 95%, or more) amino acid sequence identity with a sequence selected from the group of SEQ ID NOS:127-136. Alternatively, the second polynucleotide sequence that has at least about 55%, preferably at least about 65%, and more preferably at least about 75% (e.g., at least about 80%, at least about 85%, at least about 90%, at least about 95%, or more) nucleotide identity with a sequence selected from the group of SEQ ID NOS:223-226. In some aspects, it is desirable that such nucleic acids comprise a sequence selected from the group of SEQ ID NOS:223-226.

In another aspect, the invention provides a nucleic acid comprising a polynucleotide sequence that has substantial sequence identity (e.g., at least about 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity) with the polypeptide sequence of at least one of SEQ ID NOS:201-210, 254-271, 342, and 344. In particular aspects, the nucleic acid comprises a sequence selected from the group of SEQ ID NOS:201-210, 254-271342, and 344. Such a nucleic acid encodes a recombinant C15/full length prM/full length E polypeptide of the invention and is generally termed a recombinant C15/full prM/full E polypeptide-encoding nucleic acid. Such nucleic acids having one or more properties of the recombinant nucleic acids are described below.

The invention also provides nucleic acids that hybridize with any of the disclosed and/or above-described nucleic acid sequences of the invention under at least moderately stringent hybridization conditions, at least stringent hybridization conditions, at least highly stringent hybridization conditions, or preferably very stringent hybridization conditions over substantially the entire length of a nucleic acid. “Substantially the entire length of a nucleic acid sequence” refers to at least about 50%, generally at least about 60%, at least about 70%, or at least about 75%, usually at least about 80%, at least about 85%, at least about 88%, and typically at least about 90%, e.g., at least about 91%, about 92%, about 93%, about 94%, about 95%, about 96%, about 97%, about 98%, about 99%, about 99.5%, or more, of the length of the nucleic acid sequence. Thus, the invention provides a polynucleotide that comprises a nucleic acid sequence (a test sequence) that hybridizes to at least about 50%, preferably at least about 65%, and more preferably at least about 80% of a reference sequence (e.g., a nucleic acid sequence disclosed herein, such as, for example, a sequence selected from the group of SEQ ID NOS:156-218, 235, 254-271, 285-330, 342, and 344). More preferably, the hybridizing nucleic acid hybridizes to the disclosed nucleic acid sequence (e.g., a sequence selected from said above-referenced SEQ ID NOS) under at least stringent conditions, and, even more preferably under at least high stringency conditions. Moderately stringent, stringent, and highly stringent hybridization conditions for nucleic acid hybridization experiments are known in the art. As such, only examples of the factors that can be combined to achieve such levels of stringency are briefly discussed herein.

Exemplary moderate stringency conditions include overnight incubation at 37° C. in a solution comprising 20% formalin (or formamide), 0.5×SSC, 50 mM sodium phosphate (pH 7.6), 5×Denhardt's solution, 10% dextran sulfate, and 20 mg/mL denatured sheared salmon sperm DNA, followed by washing the filters in 1×SSC at about 37-50° C., or substantially similar conditions, e.g., the moderately stringent conditions described in Sambrook et al., supra, and/or Ausubel, supra.

Exemplary stringent (or regular stringency) conditions for analysis of nucleic acids comprising at least 100 nucleotides include incubation in a solution comprising 50% formalin (or formamide) with 1 mg of heparin at 42° C., with the hybridization being carried out overnight. A regular stringency wash can be carried out using, e.g., a solution comprising 0.2×SSC wash at about 65° C. for about 15 minutes (see Sambrook, supra, for a description of SSC buffer). Often, the regular stringency wash is preceded by a low stringency wash to remove background probe signal. A low stringency wash can be carried out in, for example, a solution comprising 2×SSC at about 40° C. for about 15 minutes. A highly stringent wash can be carried out using a solution comprising 0.15 M NaCl at about 72° C. for about 15 minutes. An example medium (regular) stringency wash, less stringent than the regular stringency wash described above, for a duplex of, e.g., more than 100 nucleotides, can be carried out in a solution comprising 1×SSC at 45° C. for 15 minutes. An example low stringency wash for a duplex of, e.g., more than 100 nucleotides, is carried out in a solution of 4-6×SSC at 40° C. for 15 minutes. For short probes (e.g., about 10 to 50 nucleotides), stringent conditions typically involve salt concentrations of less than about 1.0 M Na⁺ ion, typically about 0.01 to 1.0 M Na⁺ ion concentration (or other salts) at pH 7.0 to 8.3, and the temperature is typically at least about 30° C. Stringent conditions can also be achieved with the addition of destabilizing agents such as formamide.

High stringency conditions are conditions that use, for example, (1) low ionic strength and high temperature for washing, such as 0.015 M sodium chloride/0.0015 M sodium citrate/0.1% sodium dodecyl sulfate (SDS) at 50° C., (2) employ a denaturing agent during hybridization, such as formamide, e.g., 50% (v/v) formamide with 0.1% BSA/0.1% Ficoll/0.1% polyvinylpyrrolidone (PVP)/50 mM sodium phosphate buffer at pH 6.5 with 750 mM sodium chloride, 75 mM sodium citrate at 42° C., or (3) employ 50% formamide, 5×SSC (0.75 M NaCl, 0.075 M sodium citrate), 50 mM sodium phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5×Denhardt's solution, sonicated salmon sperm DNA (50 μg/mL), 0.1% SDS, and 10% dextran sulfate at 42° C., with washes at (i) 42° C. in 0.2×SSC, (ii) at 55° C. in 50% formamide and (iii) at 55° C. in 0.1×SSC (preferably in combination with EDTA).

More generally or alternatively, high stringency conditions are selected such that hybridization occurs at about 5° C. or less than the thermal melting point (T_(m)) for the specific sequence at a defined ionic strength and pH. The T_(m) is the temperature (under defined ionic strength and pH) at which 50% of the test sequence hybridizes to a perfectly matched probe. In other words, the T_(m) indicates the temperature at which the nucleic acid duplex is 50% denatured under the given conditions and its represents a direct measure of the stability of the nucleic acid hybrid. Thus, the T_(m) corresponds to the temperature corresponding to the midpoint in transition from helix to random coil; it depends on length, nucleotide composition, and ionic strength for long stretches of nucleotides. Typically, under “stringent conditions,” a probe will hybridize to its target subsequence, but to no other sequences. “Very stringent conditions” are selected to be equal to the T_(m) for a particular probe.

The T_(m) of a DNA-DNA duplex can be estimated using equation (1): T_(m)(° C.)=81.5° C.+16.6(log₁₀M)+0.41(% G+C)−0.72(% f)−500/n, where M is the molarity of the monovalent cations (usually Na⁺), (% G+C) is the percentage of guanosine (G) and cytosine (C) nucleotides, (% f) is the percentage of formalize and n is the number of nucleotide bases (i.e., length) of the hybrid. See Rapley and Walker, MOLECULAR BIOMETHODS HANDBOOK supra. The T_(m) of an RNA-DNA duplex can be estimated using equation (2): T_(m) (° C.)=79.8° C.+18.5 (log₁₀M)+0.58 (% G+C)−11.8(% G+C)²−0.56(% f)−820/n, where M is the molarity of the monovalent cations (usually Na⁺), (% G+C) is the percentage of guanosine (G) and cytosine (C) nucleotides, (% f) is the percentage of formamide and n is the number of nucleotide bases (i.e., length) of the hybrid. Id. Equations 1 and 2 above are typically accurate only for hybrid duplexes longer than about 100-200 nucleotides. Id. The Tm of nucleic acid sequences shorter than 50 nucleotides can be calculated as follows: T_(m)(° C.)=4(G+C)+2(A+T), where A (adenine), C, T (thymine), and G are the numbers of the corresponding nucleotides.

In general, unhybridized nucleic acid material desirably is removed by a series of washes, the stringency of which can be adjusted depending upon the desired results, in conducting hybridization analysis. Low stringency washing conditions (e.g., using higher salt and lower temperature) increase sensitivity, but can product nonspecific hybridization signals and high background signals. Higher stringency conditions (e.g., using lower salt and higher temperature that is closer to the hybridization temperature) lower the background signal, typically with only the specific signal remaining. Addition useful guidance concerning such hybridization techniques is provided in, e.g., Rapley and Walker, MOLECULAR BIOMETHODS HANDBOOK, supra (in particular, with respect to such hybridization experiments, part I, chapter 2, “Overview of principles of hybridization and the strategy of nucleic acid probe assays”), Elsevier, New York, as well as in Ausubel, supra, Sambrook et al., supra, Watson et al., supra, Hames and Higgins (1995) GENE PROBES 1, IRL Press at Oxford University Press, Oxford, England, and Hames and Higgins (1995) GENE PROBES 2, IRL Press at Oxford University Press, Oxford, England.

Preferably, the hybridization analysis is carried out under hybridization conditions selected such that a perfectly complementary oligonucleotide to the recombinant dengue antigen-encoding sequence or otherwise disclosed sequence hybridizes with the recombinant dengue antigen-encoding sequence with at least about 2 times (2×) (e.g., about 2.5 times), desirably at least about 5 times, preferably at least about 7 times, and more preferably at least about 10 times, higher signal-to-noise ratio than for hybridization of the perfectly complementary oligonucleotide to a control nucleic acid comprising a polynucleotide sequence that is at least about 90% identical to a polynucleotide sequence that encodes approximately the same length WT dengue virus polypeptide or encodes a corresponding WT dengue virus polypeptide. For example, if the recombinant sequence is a PRM15/tE-encoding polynucleotide sequence, a control nucleic acid may be a wild-type PRM15/tE-encoding polynucleotide that encodes a DEN-1 PRM15/tE, DEN-2 PRM15/tE, DEN-3 PRM15/tE, or DEN-4 PRM15/tE polypeptide or the control nucleic acid may comprises a polynucleotide sequence that is at least about 90% identity with the nucleotide sequence of any of: SEQ ID NO:231 (DEN-1 PRM15/E trunc WT cDNA sequence comprising nucleic acid residues 894-2285 of GenBank Acc. No. AB074761); SEQ ID NO:232 (DEN-2 PRM15/E trunc WT cDNA sequence comprising nucleic acid residues 892-2274 of GenBank Acc. No. NC_(—)001474); SEQ ID NO:233 (DEN-3 PRM15/E trunc WT cDNA sequence comprising nucleic acid residues 893-2263 of GenBank Acc. No. M25277); and SEQ ID NO:234 (DEN-4 PRM15/E trunc WT cDNA sequence comprising nucleic acid residues 894-2285 of GenBank Acc. No. M14931). Such conditions can be considered indicative for specific hybridization.

The above-described hybridization conditions can be adjusted, or alternative hybridization conditions selected, to achieve any desired level of stringency in selection of a hybridizing nucleic acid sequence. For example, the above-described highly stringent hybridization and wash conditions can be gradually increased (e.g., by increasing temperature, decreasing salt concentration, increasing detergent concentration and/or increasing the concentration of organic solvents, such as formalin, in the hybridization or wash), until a selected set of criteria are met. For example, the hybridization and wash conditions can be gradually increased until a desired probe, binds to a perfectly matched complementary target, with a signal-to-noise ratio that is at least about 2.5×, and optionally at least about 5× (e.g., about 10×, about 20×, about 50×, about 100×, or even about 500×), as high as the signal-to-noise ration observed from hybridization of the probe to a nucleic acid not of the invention, such as a known nucleic acid sequence selected from GenBank, and/or a wild-type dengue virus nucleic acid sequence, including, e.g., a wild-type dengue virus nucleic acid encoding a wild-type dengue virus truncated E polypeptide, full E polypeptide, PRM15/tE polypeptide, PRM15/full E polypeptide, or C15/full prM/full E polypeptide. In hybridization analyses, a recombinant polypeptide of the invention is typically compared with or analyzed in view of a wild-type polypeptide of approximately the same length or comprising the same or similar format (e.g., a recombinant C15/full prM/full E polypeptide of the invention is compared with a wild-type flavivirus C15/full prM/full E polypeptide).

In one aspect, the invention provides a nucleic acid comprising a sequence of at least about 900 nucleotides (including, e.g., at least about 900-3000 nucleotides, at least about 1000-2000 nucleotides), preferably at least about 1200 nucleotides, and more preferably at least about 1300 nucleotides, that encodes an amino acid sequence that induces an immune response to a dengue virus in a subject, wherein the nucleic acid selectively hybridizes under at least moderately stringent conditions to at least one of SEQ ID NOS:156-200 and 235 as compared to a wild-type PRM15/tE-encoding sequence (e.g., SEQ ID NOS:231-234) or a portion or fragment thereof. In one aspect, for example, the nucleic acid of the invention preferably hybridizes to at least one of SEQ ID NOS:156-200 and 235 as compared to a WT PRM15/truncated E-encoding sequence from DEN-3 (e.g., nucleotides 893-2263 of the nucleotide sequence recorded under GenBank Accession No. M25277). Preferably, the nucleic acid selectively hybridizes to at least one of SEQ ID NOS:156-200 and 235, as compared to such WT PRM15/tE-encoding sequences and/or PRM15/tE-encoding sequences of any known wild-type and/or modified (e.g., attenuated strain) virus, including flaviviruses (and dengue viruses) shown in GenBank. The determination of those segments of wild-type flaviviral nucleotide sequences or WT flaviviral genomes that encode, e.g., a WT PRM15/truncated E protein, WT PRM15/full E protein, WT C15/full prM/full E protein is within the abilities of one of ordinary skill in the art.

In another aspect, the invention provides a nucleic acid comprising a sequence of at least about 900 nucleotides, usually at least about 1000 nucleotides, typically at least about 1100 or about 1200 nucleotides, and more preferably at least about 1300 nucleotides, that encodes an amino acid sequence that induces an immune response to a dengue virus in a subject, wherein the nucleic acid selectively hybridizes under moderately stringent conditions to at least one of SEQ ID NOS:211-218 as compared to any of SEQ ID NOS:231-234 or another known PRM15/tE-encoding dengue virus polynucleotide. Desirably, such a nucleic acid of the invention hybridizes to at least one of SEQ ID NOS:211-218 more selectively than it hybridizes to any of SEQ ID NOS:231-234 or another known PRM15/tE-encoding dengue virus polynucleotide under highly stringent conditions, preferably under regularly stringent conditions, and, most preferably, under moderately stringent conditions.

The invention also provides nucleic acids comprising a sequence that does not hybridize to one of the specifically disclosed nucleic acid sequences of the invention (e.g., to a sequence selected from the group of SEQ ID NOS:156-218, 235, 253-271, 285-330, 342, and 344), but would so hybridize but for the degeneracy of the nucleic acid code and/or the imposition of non-coding sequences (e.g., nucleotide sequences that are spliced out of a DNA sequence to form an RNA intermediate that encodes a polypeptide having an amino acid sequence substantially identical (e.g., having at least about 75%, 80%, 85%, 86%, 87%, 88% or 89%, preferably at least about 90%, 91%, 92%, 93%, or 94%, and more preferably at least about 95% (e.g., about 87-95%), 96% 97%, 98%, 99%, 99.5% sequence identity) with that of any recombinant polypeptide sequence of the invention) that do not impact on the ability of the otherwise hybridizing nucleic acid (target or test nucleic acid) to express a polypeptide that is structurally, functionally, or otherwise similar to at least one of the recombinant polypeptides of the invention. Such non-hybridizing, but related, target sequences desirably are at least about 60, desirably at least about 300, preferably at least about 900, more preferably at least about 1200, and more preferably at least about 1300 nucleotides in length.

In one aspect, the invention provides a nucleic acid that encodes a polypeptide comprising an immunogenic amino acid sequence of at least about 400 amino acid residues that has substantial identity (e.g., at least about 75%, 80%, 85%, 86%, 87%, 88% or 89%, preferably at least about 90%, 91%, 92%, 93%, or 94%, and more preferably at least about 95% (e.g., about 87-95%), 96% 97%, 98%, 99%, 99.5% sequence identity) with at least one of SEQ ID NOS:1-49 and 153-155. In another aspect, the invention includes a the nucleic acid that encodes a polypeptide comprising an immunogenic amino acid sequence of at least about 450 amino acid residues that exhibits substantial identity (e.g., at least about 75%, 80%, 85%, 86%, 87%, 88% or 89%, preferably at least about 90%, 91%, 92%, 93%, or 94%, and more preferably at least about 95% (e.g., about 87-95%), 96% 97%, 98%, 99%, 99.5% sequence identity) to at least one sequence selected from the group of SEQ ID NOS:65-116. More particularly, the nucleic acid encodes a polypeptide comprising an immunogenic amino acid sequence of at least about 400 amino acid residues that has substantial identity (e.g., at least about 75%, 80%, 85%, 86%, 87%, 88% or 89%, preferably at least about 90%, 91%, 92%, 93%, or 94%, and more preferably at least about 95% (e.g., about 87-95%), 96% 97%, 98%, 99%, 99.5% sequence identity) with at least one sequence selected from any of SEQ ID NOS:66, 67, 69, 89, 93, and 108-110.

In another aspect, the invention provides a nucleic acid that encodes a polypeptide that comprises an immunogenic amino acid sequence of at least about 650 amino acid residues that exhibits substantial identity (e.g., at least about 75%, 80%, 85%, 86%, 87%, 88% or 89%, preferably at least about 90%, 91%, 92%, 93%, or 94%, and more preferably at least about 95% (e.g., about 87-95%), 96% 97%, 98%, 99%, 99.5% sequence identity) with at least one of SEQ ID NOS:139-148, 236-253, 343, and 345. More particularly, the nucleic acid encodes a polypeptide comprising an immunogenic amino acid sequence of at least about 650 amino acid residues that has substantial identity (e.g., at least about 75%, 80%, 85%, 86%, 87%, 88% or 89%, preferably at least about 90%, 91%, 92%, 93%, or 94%, and more preferably at least about 95% (e.g., about 87-95%), 96% 97%, 98%, 99%, 99.5% sequence identity) with any of SEQ ID NOS:140-141. Preferably, the nucleic acid encodes a polypeptide comprising SEQ ID NO:141 or 140.

The nucleic acids of the invention desirably encode a polypeptide that induces an immune response to at least one dengue virus of at least one serotype in a subject. More preferably, the nucleic acid comprises a sequence that encodes at least one polypeptide comprising an immunogenic amino acid sequence that induces an immune response to at least one dengue virus of each of four virus serotypes in a subject. Desirably, the amino acid encoded by such a nucleic acid sequence induces an immune response against at least one dengue virus of at least one serotype in a subject that is at least equal to or greater than the immune response induced against at least one dengue virus of at least one serotype in the subject by a corresponding wild-type dengue virus polypeptide of a particular serotype. Desirably, the nucleic acid encodes a polypeptide that exhibits a humoral and cellular immune response to a dengue virus, preferably to one or more dengue viruses of multiple (optimally all four) virus serotypes in a subject (e.g., a recombinant C15/full prM/full E polypeptide, such as, e.g., a polypeptide comprising the sequence of SEQ ID NO:141), where at least the humoral, the cellular, or preferably both immune responses promoted/induced by the encoded polypeptide are about equal to or greater than the corresponding humoral response, cellular response, and/or both such immune responses, respectively, that are induced/promoted by a corresponding wild-type dengue virus polypeptide (e.g., wild-type dengue virus C15/full prM/full E polypeptide of one or more serotypes).

In one aspect, the recombinant nucleic acid comprises a polynucleotide sequence that encodes at least one polypeptide that induces the production of one or more neutralizing antibodies in a subject against at least one dengue virus of at least one serotype. In a particular aspect, the recombinant nucleic acid encodes a polypeptide that induces the production of neutralizing antibodies against at least one dengue virus of each of at least two, at least three, or preferably at least four serotypes in a subject. Preferably, the nucleic acid encodes a polypeptide that induces a titer of neutralizing antibodies against at least one dengue virus of at least one serotype that is at least equal to or greater than the titer of neutralizing antibodies induced against the at least one dengue virus of the at least one serotype by a wild-type nucleic acid encoding a wild-type dengue virus antigen of the same or similar size and of the same format. For example, in one aspect, a PRM15/tE polypeptide-encoding nucleic acid (e.g., SEQ ID NOS:156-200 and 235) is provided that encodes a polypeptide that induces a titer of neutralizing antibodies against at least one dengue virus of at least one serotype that is at least equal to or greater than the titer of neutralizing antibodies induced against the at least one dengue virus of the at least one serotype by a wild-type PRM15/tE polypeptide-encoding nucleic acid (e.g., any of SEQ ID NOS:149-152).

Preferably, a recombinant nucleic acid of the invention comprises a polynucleotide sequence that encodes an immunogenic polypeptide (e.g., a nucleic acid encoding a recombinant tE, full E, PRM15/tE, C15/full prM/full E, PRM15/full E, or prM/full E polypeptide) that induces the production of one or more neutralizing antibodies to at least one dengue virus of each of the four serotypes in a subject. The neutralizing antibody response induced by such encoded immunogenic polypeptide typically does not induce ADE in a mammal upon infection of the mammal with a dengue virus and/or upon secondary infection of the mammal with a dengue virus of a different serotype than the serotype of the virus the mammal was infected with before receiving the recombinant nucleic acid.

In another aspect, a recombinant nucleic acid of the invention comprises a polynucleotide sequence that encodes a polypeptide that induces a protective immune response to at least one dengue virus of at least one serotype in a subject. In another aspect, the nucleic acid comprises a polynucleotide sequence that encodes a polypeptide that induces a protective immune response against at least one dengue virus of each of at least two, preferably at least three, and more preferably against all four virus serotypes when the polypeptide is expressed in a subject.

The invention further provides a nucleic acid comprising a fragment of the one of the nucleic acids of the invention that encodes a polypeptide that induces an immune response against at least one dengue virus in a subject, which fragment is unique as compared to WT dengue virus antigens of similar size and/or format. Preferably, the nucleic acid fragment encodes a polypeptide that induces an immune response to at least one dengue virus of all four serotypes, and, more preferably, induces production of a neutralizing antibody and/or protective immune response against at least one dengue virus of all four serotypes, in a subject (most desirably without the occurrence of ADE).

The nucleic acid of the invention can comprise any suitable number of other sequences in addition to the above-described recombinant dengue antigen-encoding sequences and also or alternatively can comprise any suitable number of recombinant dengue antigen-encoding sequences. For example, a nucleic acid can comprise two or more copies of a dengue antigen-encoding sequence and/or nucleotide sequences encoding multiple different dengue antigen-encoding sequences. For example, a nucleic acid can comprise one or more of the following: (1) a nucleotide sequence encoding a recombinant C15/full prM/full E polypeptide; (2) a nucleotide sequence encoding a recombinant full prM/full E polypeptide; (3) a nucleotide sequence encoding a recombinant PRM15/tE polypeptide; (4) a nucleotide sequence encoding a WT or recombinant signal peptide, C15 or PRM15; (5) a nucleotide sequence encoding a recombinant tE polypeptide; and (6) a nucleotide sequence encoding a recombinant full E polypeptide.

In one particular aspect, the nucleic acid comprises a second sequence encoding an adjuvant and/or a cytokine, a costimulatory molecule (e.g., a mammalian B7-1 or B7-2 or an amino acid sequence that has at least substantial identity thereto or comprises a variant thereof), or a heterologous antigen (e.g., a yellow fever antigen, a malaria vaccine, etc.). The nucleic acid can comprise any suitable number and copy of such sequences, in any suitable combination, along with the recombinant dengue antigen-encoding sequence(s). The sequences can be part of a single expression cassette, but more typically and preferably are contained in separate expression cassettes (examples of which are discussed further below). In some aspects, the recombinant dengue antigen-encoding sequence and the secondary nucleic acid sequence (e.g., the cytokine-encoding sequence) are operably linked to separate and different expression control sequences, such that they are expressed at different times and/or in response to different conditions (e.g., in response to different inducers).

In general, any of the nucleic acids of the invention can be modified to increase expression in a particular host, using the techniques exemplified herein with respect to the above-described dengue virus prM/E fusion protein-encoding sequences. Codons that are utilized most often in a particular host are called optimal codons, and those not utilized very often are classified as rare or low-usage codons (see, e.g., Zhang, S. P. et al. (1991) Gene 105:61-72). Codons can be substituted to reflect the preferred codon usage of the host, a process called “codon optimization” or “controlling for species codon bias.” Optimized coding sequence comprising codons preferred by a particular prokaryotic or eukaryotic host can be used to increase the rate of translation or to produce recombinant RNA transcripts having desirable properties, such as a longer half-life, as compared with transcripts produced from a non-optimized sequence. Techniques for producing codon optimized sequences are known (see, e.g., E. et al. (1989) Nuc Acids Res 17:477-508). Translation stop codons can also be modified to reflect host preference. For example, preferred stop codons for S. cerevisiae and mammals are UAA and UGA respectively. The preferred stop codon for monocotyledonous plants is UGA, whereas insects and E. coli prefer to use UAA as the stop codon (see, e.g., Dalphin, M. E. et al. (1996) Nuc Acids Res 24:216-218). The arrangement of codons in context to other codons also can influence biological properties of a nucleic acid sequences, and modifications of nucleic acids to provide a codon context arrangement common for a particular host also is contemplated by the inventors. Thus, a nucleic acid sequence of the invention can comprise a codon optimized nucleotide sequence, i.e., codon frequency optimized and/or codon pair (i.e., codon context) optimized for a particular species (e.g., the polypeptide can be expressed from a polynucleotide sequence optimized for expression in humans by replacement of “rare” human codons based on codon frequency, or codon context, such as by using techniques such as those described in Buckingham et al. (1994) Biochimie 76(5):351-54 and U.S. Pat. Nos. 5,082,767, 5,786,464, and 6,114,148). An exemplary technique for producing codon optimized sequences is provided in Example 1.

In addition to the above-described codon optimized nucleic acid sequences (e.g., recombinant tE-encoding polynucleotide sequence, full E-encoding polynucleotide sequence, PRM15/tE-encoding polynucleotide sequence, C15/full length prM/full length E-encoding polynucleotide sequence, etc.), the nucleic acids of the invention generally express polypeptides at expression levels higher than does a corresponding wild-type polynucleotide sequence encoding a wild-type dengue virus polypeptide sequences (e.g., WT dengue virus tE-encoding polynucleotide sequence, WT dengue virus full E-encoding polynucleotide sequence, WT dengue PRM15/tE-encoding polynucleotide sequence, WT C15/full length prM/full length E-encoding polynucleotide sequence, etc.). Thus, for example, the invention provides nucleic acids encoding one or more recombinant PRM15/tE polypeptides of the invention, wherein at least one such recombinant polypeptide is expressed more efficiently than a nucleic acid comprising at least a portion of any one of SEQ ID NOS:231-234, of substantially the same length, when expressed from a substantially identical expression cassette in a subject host, such as a mammalian host.

Remarkably, some recombinant C15/full prM/full E polypeptides of the invention, expressed from the C15/full prM/full E polypeptide-encoding polynucleotides described herein, also exhibit higher levels of secretion than codon optimized C15/full prM/full E polypeptide-encoding sequences (e.g., any of SEQ ID NOS:215-218).

The nucleic acid is typically a DNA, and usually a double stranded DNA sequence. However, the invention also provides single stranded DNA, single stranded RNA, double stranded RNA, and hybrid DNA/RNA nucleic acids comprising the nucleic acid sequences of the invention also are provided. In one aspect, the invention includes a RNA sequence comprising any DNA nucleotide sequence of the invention described herein and throughout in which the thymine nucleotides in the sequence are replaced with uracil nucleotides. The invention also provides, for example, an RNA nucleic acid comprising a sequence having substantially identity (e.g., having at least about 75%, 80%, 85%, 90%, 95% or more nucleic sequence identity) with at least one sequence selected from the group of SEQ ID NOS:156-218, 235, 254-271, 285-330, 342, and 344). Also provided is a RNA nucleic acid comprising a DNA sequence selected from any of this group of sequences in which all of the thymine nucleotides in the DNA sequence are replaced with uracil nucleotides, and RNA polynucleotide sequences complementary to all such RNA nucleic acids.

The invention further provides a RNA nucleic acid that exhibits substantial identity with a sequence having substantial identity (e.g., at least about 75%, 80%, 85%, 86%, 87%, 88% or 89%, preferably at least about 90%, 91%, 92%, 93%, or 94%, and more preferably at least about 95% (e.g., about 87-95%), 96% 97%, 98%, 99%, 99.5% sequence identity) with at least one selected from the group of SEQ ID NOS:285-330. More particularly, the invention provides a RNA nucleic acid comprising a DNA sequence selected from any of SEQ ID NOS:285-330 in which each thymine residue in the DNA sequence is replaced with a uracil residue. Such RNA nucleic acids typically are at least about 1000 nucleotides, typically about 1200 nucleotides, and typically about 2000 nucleotides in length. The invention also provides at least one fragment of such an RNA nucleic acid that encodes an immunogenic amino acid sequence of the invention, and RNA polynucleotide sequences that are complementary to such fragments. Also included is an RNA polynucleotide sequence that hybridizes to such an RNA nucleic acid (e.g., comprising a DNA sequence of any of SEQ ID NOS:285-330 in which each thymine nucleotides in the sequence is replaced with uracil nucleotide) under at least moderately stringent, preferably at least regularly stringent, and more preferably at least highly stringent hybridization conditions.

In one aspect of the invention, the invention provides a DNA nucleic acid that comprises at least one expression control sequence associated with and/or typically operably linked to a recombinant nucleic acid sequence of the invention (e.g., the recombinant antigen-encoding sequence). An “expression control sequence” is any nucleic acid sequence that promotes, enhances, or controls expression (typically and preferably transcription) of another nucleic acid sequence. Suitable expression control sequences include constitutive promoters, inducible promoters, repressible promoters, and enhancers.

Promoters exert a particularly important impact on the level of recombinant polypeptide expression. The nucleic acid of the invention (e.g., recombinant DNA nucleic acid) can comprise any suitable promoter. Examples of suitable promoters include the cytomegalovirus (CMV) promoter, the HIV long terminal repeat promoter, the phosphoglycerate kinase (PGK) promoter, Rous sarcoma virus (RSV) promoters, such as RSV long terminal repeat (LTR) promoters, mouse mammary tumor virus (MMTV) promoters, HSV promoters, such as the Lap2 promoter or the herpes thymidine kinase promoter (as described in, e.g., Wagner et al. (1981) Proc Natl Acad Sci 78:144-145), promoters derived from SV40 or Epstein Barr virus, adeno-associated viral (AAV) promoters, such as the p5 promoter, metallothionein promoters (e.g., the sheep metallothionein promoter or the mouse metallothionein promoter (see, e.g., Palmiter et al. (1983) Science 222:809-814), the human ubiquitin C promoter, E. coli promoters, such as the lac and trp promoters, phage lambda P_(L) promoter, and other promoters known to control expression of genes in prokaryotic or eukaryotic cells (either directly in the cell or in viruses which infect the cell). Promoters that exhibit strong constitutive baseline expression in mammals, particularly humans, such as cytomegalovirus (CMV) promoters, such as the CMV immediate-early promoter (described in, for example, U.S. Pat. No. 5,168,062), and promoters having substantial sequence identity with such a promoter, are particularly preferred. Also preferred are recombinant promoters having novel or enhanced properties, such as those described in PCT Application Int'l Publ. No. WO 02/00897.

The promoter can have any suitable mechanism of action. Thus, the promoter can be, for example, an “inducible” promoter, (e.g., a growth hormone promoter, metallothionein promoter, heat shock protein promoter, E1B promoter, hypoxia induced promoter, radiation inducible promoter, or adenoviral MLP promoter and tripartite leader), an inducible-repressible promoter, a developmental stage-related promoter (e.g., a globin gene promoter), cell-specific, or tissue specific promoter (e.g., a smooth muscle cell α-actin promoter, myosin light-chain 1A promoter, or vascular endothelial cadherin promoter). Suitable inducible promoters include ecdysone and ecdysone-analog-inducible promoters (ecdysone-analog-inducible promoters are commercially available through Stratagene (LaJolla Calif.)). Other suitable commercially available inducible promoter systems include the inducible Tet-Off or Tet-on systems (Clontech, Palo Alto, Calif.). The inducible promoter can be any promoter that is up- and/or down-regulated in response to an appropriate signal. Additional inducible promoters include arabinose-inducible promoters, a steroid-inducible promoters (e.g., a glucocorticoid-inducible promoters), as well as pH, stress, and heat-inducible promoters.

The promoter can be, and often is, a host-native promoter, or a promoter derived from a virus that infects a particular host (e.g., a human beta actin promoter, human EF1α promoter, or a promoter derived from a human AAV operably linked to the nucleic acid can be preferred), particularly where strict avoidance of gene expression silencing due to host immunological reactions to sequences that are not regularly present in the host is of concern. The polynucleotide also or alternatively can include a bi-directional promoter system (as described in, e.g., U.S. Pat. No. 5,017,478) linked to multiple genes of interest (e.g., multiple fusion protein encoding genes).

The nucleic acid also can be operably linked to a modified or chimeric promoter sequence. The promoter sequence is “chimeric” in that it comprises at least two nucleic acid sequence portions obtained from, derived from, or based upon at least two different sources (e.g., two different regions of an organism's genome, two different organisms, or an organism combined with a synthetic sequence). Suitable promoters also include recombinant, mutated, or recursively recombined (e.g., shuffled) promoters. Minimal promoter elements, consisting essentially of a particular TATA-associated sequence, can, for example, be used alone or in combination with additional promoter elements. TATA-less promoters also can be suitable in some contexts. The promoter and/or other expression control sequences can include one or more regulatory elements have been deleted, modified, or inactivated. Preferred promoters include the promoters described in Int'l Patent Application WO 02/00897, one or more of which can be incorporated into and/or used with nucleic acids and vectors of the invention. Other shuffled and/or recombinant promoters also can be usefully incorporated into and used in the nucleic acids and vectors of the invention, e.g., to facilitate polypeptide expression.

Other suitable promoters and principles related to the selection, use, and construction of suitable promoters are provided in, e.g., Werner (1999) Mamm Genome 10(2):168-75, Walther et al. (1996) J Mol Med 74(7):379-92, Novina (1996) Trends Genet. 12(9):351-55, Hart (1996) Semin Oncol 23(1):154-58, Gralla (1996) Curr Opin Genet Dev 6(5):526-30, Fassler et al. (1996) Methods Enzymol 273:3-29, Ayoubi et al (1996), 10(4) FASEB J 10(4):453-60, Goldsteine et al. (1995) Biotechnol Annu Rev 1:105-28, Azizkhan et al (1993) Crit. Rev Eukaryot Gene Expr 3(4):229-54, Dynan (1989) Cell 58(1):1-4, Levine (1989) Cell 59(3):405-8, and Berk et al (1986) Annu Rev Genet. 20:45-79, as well as U.S. Pat. No. 6,194,191. Other suitable promoters can be identified by use of the Eukaryotic Promoter Database (release 68) (presently available at “http” colon, slash, slash, “www” dot “epd” dot “isb-sib” dot “ch” slash) and other, similar, databases, such as the Transcription Regulatory Regions Database (TRRD) (version 4.1) (available at “http” colon, slash, slash, “www” dot “bionet” dot “nsc” sot “ru” slash “trrd” slash) and the transcription factor database (TRANSFAC) (available at “http” colon, slash, slash, “transfac” dot “gbf” dot “de” slash “TRANSFAC” slash “index” dot “html”).

As an alternative to a promoter, particularly in RNA vectors and constructs, the nucleic acid sequence and/or vector can comprise one or more internal ribosome entry sites (IRESs), IRES-encoding sequences, or RNA sequence enhancers (Kozak consensus sequence analogs), such as the tobacco mosaic virus omega prime sequence.

The invention also provides a polynucleotide (or vector) that also or alternatively comprises an upstream activator sequence (UAS), such as a Gal4 activator sequence (as described in, e.g., U.S. Pat. No. 6,133,028) or other suitable upstream regulatory sequence (as described in, e.g., U.S. Pat. No. 6,204,060).

In addition to an immunogenic polynucleotide sequence, a polynucleotide or vector of the invention can include any other expression control sequences (e.g., enhancers, translation termination sequences, initiation sequences, splicing control sequences, etc.). The polynucleotide may include a Kozak consensus sequence that is functional in a mammalian cell, which can be a naturally occurring or modified sequence such as the modified Kozak consensus sequences described in U.S. Pat. No. 6,107,477. The nucleic acid can include specific initiation signals that aid in efficient translation of a coding sequence and/or fragments contained in the expression vector. These signals can include, e.g., the ATG initiation codon and adjacent sequences. In cases where a coding sequence, its initiation codon and upstream sequences are inserted into the appropriate expression vector, no additional translational control signals may be needed. However, in cases where only coding sequence (e.g., a mature protein coding sequence), or a portion thereof, is inserted, exogenous nucleic acid transcriptional control signals including the ATG initiation codon must be provided. Furthermore, the initiation codon must be in the correct reading frame to ensure transcription of the entire insert. Exogenous transcriptional elements and initiation codons can be of various origins, both natural and synthetic. The efficiency of expression can enhanced by the inclusion of enhancers appropriate to the cell system in use (see, e.g., Scharf D. et al. (1994) Results Probl Cell Differ 20:125-62; and Bittner et al. (1987) Methods in Enzymol 153:516-544 for discussion). Suitable enhancers include, for example, the rous sarcoma virus (RSV) enhancer and the RTE enhancers described in U.S. Pat. No. 6,225,082. Initiation signals including the ATG initiation codon and adjacent sequences are desirably incorporated in the polynucleotide. In cases where a polynucleotide sequence, its initiation codon and upstream sequences are inserted into the appropriate expression vector, no additional translational control signals may be needed. However, in cases where only coding sequence (e.g., a mature protein coding sequence), or a portion thereof, is inserted, exogenous nucleic acid transcriptional control signals including the ATG initiation codon are to be provided. The initiation codon must be in the correct reading frame to ensure transcription of the entire insert. Exogenous transcriptional elements and initiation codons can be of various origins, both natural and synthetic. The efficiency of expression can enhanced by the inclusion of enhancers appropriate to the cell system in use (see, e.g., Scharf D. et al. (1994) Results Probl Cell Differ 20:125-62; and Bittner et al. (1987) Meth in Enzymol 153:516-544).

A nucleic acid of the invention (e.g., DNA) may also comprise a ribosome binding site for translation initiation and a transcription-terminating region. A suitable transcription-terminating region is, for example, a polyadenylation sequence that facilitates cleavage and polyadenylation of the RNA transcript produced from the DNA nucleic acid. Any suitable polyadenylation sequence can be used, including a synthetic optimized sequence, as well as the polyadenylation sequence of BGH (Bovine Growth Hormone), human growth hormone gene, polyoma virus, TK (Thymidine Kinase), EBV (Epstein Barr Virus), rabbit beta globin, and the papillomaviruses, including human papillomaviruses and BPV (Bovine Papilloma Virus). Preferred polyadenylation (polyA) sequences include the SV40 (human Sarcoma Virus-40) polyadenylation sequence and the BGH polyA sequence, which is particularly preferred. Such polyA sequences are described in, e.g., Goodwin et al. (1998) Nucleic Acids Res 26(12):2891-8, Schek et al. (1992) Mol Cell Biol 12(12):5386-93, and van den Hoff et al. (1993) Nucleic Acids Res 21(21):4987-8. Additional principles related to selection of appropriate polyadenylation sequences are described in, e.g., Levitt et al. (1989) Genes Dev 1989 3(7):1019-1025, Jacob et al. (1990) Crit Rev Eukaryot Gene Expr 1(1):49-59, Chen et al. (1995) Nucleic Acids Res 23(14):2614-2620, Moreira et al. (1995) EMBO J 14(15):3809-3819, Carswell et al. (1989) Mol Cell Biol 1989 9(10):4248-4258.

The polynucleotide can further comprise site-specific recombination sites, which can be used to modulate transcription of the polynucleotide, as described in, e.g., U.S. Pat. Nos. 4,959,317, 5,801,030 and 6,063,627, European Patent Application 0 987 326 and International Patent Application WO 97/09439.

The invention further provides a nucleic acid of the invention that further comprises one or more immunostimulatory oligonucleotide sequences, e.g., a sequence according to the sequence pattern N₁CGN₂)_(x), wherein N₁ is, 5′ to 3′, any two purines, any purine and a guanine, or any three nucleotides; N₂ is, 5′ to 3′, any two purines, any guanine and any purine, or any three nucleotides; and x is any number greater than 0 or 1. Immunomodulatory sequences are known in the art, and described in, e.g., Wagner et al. (2000) Springer Semin Immunopathol 22(1-2):147-52, Van Uden et al. (2000) Springer Semin Immunopathol 22(1-2):1-9, and Pisetsky (1999) Immunol Res 19(1):35-46, as well as U.S. Pat. Nos. 6,207,646, 6,194,388, 6,008,200, 6,239,116, and 6,218,371. The immunostimulatory oligonucleotide sequence(s) may be unmethylated. In another aspect, the invention provides a nucleic acid that comprises a polynucleotide sequence that encodes one or more recombinant polypeptides of the invention and further comprises at least one polynucleotide sequence that encodes at least one immunostimulatory sequence as described herein. Alternatively, the immunostimulatory oligonucleotide sequence is expressed from a second polynucleotide sequence that is separate from (e.g., on a separate or second vector) the first polynucleotide sequence encoding the recombinant polypeptide of the invention

In another aspect, the invention provides a nucleic acid that comprises a polynucleotide sequence that encodes one or more recombinant polypeptides of the invention and further comprises at least one polynucleotide sequence that encodes at least one protein adjuvant. Alternatively, the protein adjuvant is expressed from a second polynucleotide sequence that is separate from (e.g., on a separate or second vector) the first polynucleotide sequence encoding the recombinant polypeptide of the invention. Preferably, the adjuvant is a cytokine that promotes the immune response induced by at least immunogenic recombinant polypeptide of the invention. Preferably, the cytokine is a granulocyte macrophage colony stimulating factor (a GM-CSF, e.g., a human GM-CSF) an interferon (e.g., human interferon (IFN) alpha, IFN-beta, IFN-gamma), or a peptide comprising an amino acid sequence that is at least substantially identical (e.g., having at least about 75%, 80%, 85%, 86%, 87%, 88% or 89%, preferably at least about 90%, 91%, 92%, 93%, or 94%, and more preferably at least about 95% (e.g., about 87-95%), 96% 97%, 98%, 99%, 99.5% or more sequence identity) to the sequence of at least one such cytokine. Genes encoding such factors are known. Human GM-CSF sequences are described in, e.g., Wong et al. (1985) Science 228:810, Cantrell et al. (1985) Proc Natl Acad Sci 82:6250, and Kawasaki et al. (1985) Science 230:291. Desirably, in one embodiment, such a nucleic acid expresses an amount of GM-CSF or a functional analog thereof that detectably stimulates the mobilization and differentiation of dendritic cells (DCs) and/or T-cells, increases antigen presentation, and/or increases monocytes activity, such that the immune response induced by the immunogenic amino acid sequence is increased. Suitable interferon genes, such as IFN-gamma genes also are known (see, e.g., Taya et al. (1982), Embo J 1:953-958, Cerretti et al. (1986) J Immunol 136(12): 4561, and Wang et al. (1992) Sci China B 35(1):84-91). Desirably, the IFN, such as the IFN-gamma, is expressed from the nucleic acid in an amount that increases the immune response of the immunogenic amino acid sequence (e.g., by enhancing a T cell immune response induced by the immunogenic amino acid sequence). Advantageous IFN-homologs and IFN-related molecules that can be co-expressed or co-administered with the polynucleotide and/or polypeptide of the invention are described in, e.g., International Patent Applications WO 01/25438 and WO 01/36001. Co-administration (which herein includes both simultaneous and serial administration) of about 5 to about 10 μg of a GM-CSF-encoding plasmid with from about 5 to about 50 μg of a plasmid encoding one of the polypeptides of the invention is expected to be effective for enhancing the antibody response in a mouse model.

In another aspect, a nucleic acid of the invention comprises a T7 RNA polymerase promoter operably linked to the nucleic acid sequence, facilitating the synthesis of single stranded RNAs from the nucleic acid sequence. T7 and T7-derived sequences are known as are exemplary expression systems using T7 (see, e.g., Tabor and Richardson (1986) Proc Natl Acad Sci USA 82: 1074, Studier and Moffat (1986) J Mol Biol 189:113, and Davanloo et al. (1964) Proc Natl Acad Sci USA 81:2035). In one aspect, for example, nucleic acids comprising a T7 RNA polymerase and a polynucleotide sequence encoding at least one recombinant polypeptide of the invention are provide. Furthermore, a nucleic acid of the invention can comprise an origin of replication useful for propagation in a microorganism. The bacterial origin of replication (Ori) utilized is preferably one that does not adversely affect gene expression in mammalian cells. Examples of useful origin of replication sequences include the f1 phage ori, RK2 oriV, pUC ori, and the pSC101 ori. Preferred original of replication sequences include the ColEI ori and the p15 (available from plasmid pACYC177, New England Biolab, Inc.), alternatively another low copy ori sequence (similar to p15) can be desirable in some contexts. The nucleic acid in this respect desirably acts as a shuttle vector, able to replicate and/or be expressed (desirably both—such vectors capable of expression can be referred to as “expression vectors”) in both eukaryotic and prokaryotic hosts (e.g., a vector comprising an origin of replication sequences recognized in both eukaryotes and prokaryotes).

A polynucleotide of the invention preferably is positioned in and/or administered in the form of a suitable delivery vehicle (i.e., a vector). The vector can be any suitable vector, including chromosomal, non-chromosomal, and synthetic nucleic acid vectors (a nucleic acid sequence comprising the above described expression cassette elements (expression control and other nucleic acid associated sequences)). Examples of such vectors include derivatives of SV40, bacterial plasmids, phage DNA, baculovirus, yeast plasmids, vectors derived from combinations of plasmids and phage DNA, and viral nucleic acid (RNA or DNA) vectors. See, e.g., FIGS. 1 and 2.

In one aspect, the nucleic acid is be administered in a naked DNA or RNA vector, including, for example, a linear expression element (as described in, e.g., Sykes and Johnston (1997) Nat Biotech 17:355-59), a compacted nucleic acid vector (as described in, e.g., U.S. Pat. No. 6,077,835 and/or International Patent Application WO 00/70087), a plasmid vector such as pBR322, pUC19/18, or pUC118/119, a “midge” minimal-sized nucleic acid vector (as described in, e.g., Schakowski et al. (2001) Mol Ther 3:793-800) or as a precipitated nucleic acid vector construct, such as a CaPO₄ precipitated construct (as described in, e.g., International Patent Application WO 00/46147, Benvenisty and Reshef (1986) Proc Natl Acad Sci USA 83:9551-55, Wigler et al. (1978), Cell 14:725, and Coraro and Pearson (1981) Somatic Cell Genetics 7:603). Nucleotide vectors and the usage thereof are known in the art (see, e.g., U.S. Pat. Nos. 5,589,466 and 5,973,972).

The vector can be an expression vector that is suitable for expression in a bacterial system. Any vector for use in a bacterial host can be utilized. Suitable vectors include, for example, vectors which direct high level expression of fusion proteins that are readily purified (e.g., multifunctional E. coli cloning and expression vectors such as BLUESCRIPT (Stratagene), pIN vectors (Van Heeke & Schuster, J Biol Chem 264:5503-5509 (1989); pET vectors (Novagen, Madison Wis.); and the like).

The expression vector also or alternatively can be a vector suitable for expression in a yeast system. Any vector suitable for expression in a yeast system can be employed. Suitable vectors for use in, e.g., Saccharomyces cerevisiae include, for example, vectors comprising constitutive or inducible promoters such as alpha factor, alcohol oxidase and PGH (reviewed in: Ausubel, supra, Berger, supra, and Grant et al. Methods in Enzymol 153: 516-544 (1987)).

The expression vector can be propagated in a host cell. The host cell can be a eukaryotic cell, such as a mammalian cell, a yeast cell, or a plant cell, or the host cell can be a prokaryotic cell, such as a bacterial cell. Introduction of the construct into the host cell can be effected by calcium phosphate transfection, DEAE-Dextran mediated transfection, electroporation, gene or vaccine gun, injection, or other common techniques (see, e.g., Davis et al., BASIC METHODS IN MOLECULAR BIOLOGY (1986) for a description of in vivo, ex vivo, and in vitro methods).

The nucleic acids and nucleic acid vectors of the invention can further comprise non-naturally occurring nucleotides and nucleotide sequences. Modifications to recombinant nucleic acid sequences of the invention can include making at least a portion or fragment of the recombinant nucleic acid sequence (e.g., a flaviviral antigen-encoding polynucleotide sequence) comprise a phosphorothioate backbone, incorporating at least one synthetic nucleotide analog in place of or in addition to the naturally occurring nucleotides in the nucleic acid sequence, and the addition of bases other than guanine, adenine, uracil, thymine, and cytosine, or the uses of such non-normally occurring bases in such a sequence. Such modifications can be associated with longer half-life, and thus can be desirable in nucleic acids vectors of the invention. Thus, in one aspect, the invention provides recombinant nucleic acids and nucleic acid vectors comprising at least one of the aforementioned modifications, or any suitable combination thereof, wherein the nucleic acid persists longer in a mammalian host than a substantially identical nucleic acid without such a modification or modifications.

The expression vector can also comprises nucleotides encoding a secretion/localization sequence, which targets polypeptide expression to a desired cellular compartment, membrane, or organelle, or which directs polypeptide secretion to the periplasmic space or into the cell culture media. Such sequences are known in the art, and include secretion leader or signal peptides, organelle targeting sequences (e.g., nuclear localization sequences, ER retention signals, mitochondrial transit sequences, chloroplast transit sequences), membrane localization/anchor sequences (e.g., stop transfer sequences, GPI anchor sequences), and the like.

In addition, the expression vectors of the invention optionally comprise one or more selectable marker genes to provide a phenotypic trait for selection of transformed host cells, such as dihydrofolate reductase resistance, neomycin resistance, puromycin resistance, and/or blasticidin resistance for eukaryotic cell culture, or such as tetracycline or ampicillin resistance in E. coli.

Additional nucleic acids provided by the invention include cosmids. Any suitable cosmid vector can be used to replicate, transfer, and express the nucleic acid sequence of the invention. Typically, a cosmid comprises a bacterial oriV, an antibiotic selection marker, a cloning site, and either one or two cos sites derived from bacteriophage lambda. The cosmid can be a shuttle cosmid or mammalian cosmid, comprising a SV40 oriV and, desirably, suitable mammalian selection marker(s). Cosmid vectors are further described in, e.g., Hohn et al. (1988) Biotechnology 10:113-27.

The present invention also includes recombinant constructs comprising one or more of the nucleic acid sequences, including a recombinant flavivirus virus antigen-encoding polynucleotide sequence (e.g., a recombinant dengue virus antigen-encoding polynucleotide sequence), as broadly described above. The constructs comprise a vector, such as, a plasmid, a cosmid, a phage, a virus, a bacterial artificial chromosome (BAC), a yeast artificial chromosome (YAC), and the like, into which a nucleic acid sequence of the invention has been inserted, in a forward or reverse orientation.

In one aspect of the invention, delivery of a recombinant DNA sequence of the invention (e.g., a recombinant flavivirus antigen-encoding DNA sequence) is accomplished with a naked DNA plasmid or plasmid associated with one or more transfection-enhancing agents, as discussed further herein. The plasmid DNA vector can have any suitable combination of features. In some aspects, preferred plasmid DNA vectors comprise a strong promoter/enhancer region (e.g., human CMV, RSV, SV40, SL3-3, MMTV, or HIV LTR promoter), an effective poly(A) termination sequence, an origin of replication for plasmid product in E. coli, an antibiotic resistance gene as selectable marker, and a convenient cloning site (e.g., a polylinker). A particular plasmid vector for delivery of the nucleic acid of the invention in this respect is the vector pMaxVax10.1, the construction and features of which are described in Example 1 (FIGS. 1 and 2). Optionally, such a plasmid vector includes at least one immunostimulatory sequence (ISS) and/or at least one gene encoding a suitable cytokine adjuvant (e.g., a GM-CSF sequence), as further described elsewhere herein.

In other aspects, the nucleic acid of the sequence of the invention is positioned in and/or delivered to the host cell or host animal via a viral vector. Any suitable viral vector can be used in this respect, and several are known in the art. A viral vector can comprise any number of viral polynucleotides, alone or in combination with one or more viral proteins, which facilitate delivery, replication, and/or expression of the nucleic acid of the invention in a desired host cell. The viral vector can be a polynucleotide comprising all or part of a viral genome, a viral protein/nucleic acid conjugate, a virus-like particle (VLP), a vector similar to those described in U.S. Pat. No. 5,849,586 and International Patent Application WO 97/04748, or an intact virus particle comprising viral nucleic acids and the nucleic acid of the invention. A viral particle viral vector (i.e., a recombinant virus) can comprise a wild-type viral particle or a modified viral particle, particular examples of which are discussed below.

The viral vector can be a vector which requires the presence of another vector or wild-type virus for replication and/or expression (i.e., a helper-dependent virus), such as an adenoviral vector amplicon. Typically, such viral vectors consist essentially of a wild-type viral particle, or a viral particle modified in its protein and/or nucleic acid content to increase transgene capacity or aid in transfection and/or expression of the nucleic acid (examples of such vectors include the herpes virus/AAV amplicons).

Preferably, though not necessarily, the viral vector particle is derived from, is based on, comprises, or consists of, a virus that normally infects animals, preferably vertebrates, such as mammals and, especially, humans. Suitable viral vector particles in this respect, include, for example, adenoviral vector particles (including any virus of or derived from a virus of the adenoviridae), adeno-associated viral vector particles (AAV vector particles) or other parvoviruses and parvoviral vector particles, papillomaviral vector particles, flaviviral vectors, alphaviral vectors, herpes viral vectors, pox virus vectors, retroviral vectors, including lentiviral vectors. Examples of such viruses and viral vectors are in, e.g., FIELDS VIROLOGY, supra, Fields et al., eds., VIROLOGY Raven Press, Ltd., New York (3rd ed., 1996 and 4th ed., 2001), ENCYCLOPEDIA OF VIROLOGY, R. G. Webster et al., eds., Academic Press (2nd ed., 1999), FUNDAMENTAL VIROLOGY, Fields et al., eds., Lippincott-Raven (3rd ed., 1995), Levine, “Viruses,” Scientific American Library No. 37 (1992), MEDICAL VIROLOGY, D. O. White et al., eds., Acad. Press (2nd ed. 1994), INTRODUCTION TO MODERN VIROLOGY, Dimock, N. J. et al., eds., Blackwell Scientific Publications, Ltd. (1994).

Viral vectors that can be employed with polynucleotides of the invention and the methods described herein include adenovirus and adeno-associated vectors, as in, e.g., Carter (1992) Curr Opinion Biotech 3:533-539 (1992) and Muzcyzka (1992) Curr Top Microbiol Immunol 158:97-129 (1992). Additional types and aspects of AAV vectors are described in, e.g., Buschacher et al., Blood, 5(8), 2499-504, Carter, Contrib. Microbiol. 4: 85-86 (2000), Smith-Arica, Curr. Cardiol. Rep. 3(1):41-49 (2001), Taj, J. Biomed. Sci. 7(4):279-91 (2000), Vigna et al., J. Gene Med. 2(5):308-16 (2000), Klimatcheva et al., Front. Biosci. 4:D481-96 (1999), Lever et al., Biochem. Soc. Trans. 27(6):841-47 (1999), Snyder, J. Gene Med. 1(3):166-75 (1999), Gerich et al., Knee Surg. Sports Traumatol. Arthrosc. 5(2):118-23 (1998), and During, Adv. Drug Deliv. Review 27(1):83-94 (1997), and U.S. Pat. Nos. 4,797,368, 5,139,941, 5,173,414, 5,614,404, 5,658,785, 5,858,775, and 5,994,136, as well as other references discussed elsewhere herein). Adeno-associated viral vectors can be constructed and/or purified using the methods set forth, for example, in U.S. Pat. No. 4,797,368 and Laughlin et al., Gene 23:65-73 (1983).

Another type of viral vector that can be employed with polynucleotides and methods of the invention is a papillomaviral vector. Suitable papillomaviral vectors are known in the art and described in, e.g., Hewson (1999) Mol Med Today 5(1):8, Stephens (1987) Biochem J 248(1):1-11, and U.S. Pat. No. 5,719,054. Particularly preferred papillomaviral vectors are provided in, e.g., International Patent Application WO 99/21979.

Alphavirus vectors can be gene delivery vectors in other contexts. Alphavirus vectors are known in the art and described in, e.g., Carter (1992) Curr Opinion Biotech 3:533-539, Muzcyzka (1992) Curr Top Microbiol Immunol. 158:97-129, Schlesinger Expert Opin Biol Ther. 2001 March; 1(2):177-91, Polo et al. Dev Biol (Basel). 2000; 104:181-5, Wahlfors et al. Gene Ther. 2000 March; 7(6):472-80, Colombage et al. Virology. 1998 Oct. 10; 250(1):151-63, and International Patent Applications WO 01/81609, WO 00/39318, WO 01/81553, WO 95/07994, and WO 92/10578.

Another advantageous group of viral vectors are the herpes viral vectors. Examples of herpes viral vectors are described in, e.g., Lachmann et al., Curr Opin Mol Ther 1999 October; 1(5):622-32, Fraefel et al., Adv Virus Res. 2000; 55:425-51, Huard et al., Neuromuscul Disord. 1997 July; 7(5):299-313, Glorioso et al., Annu Rev Microbiol. 1995; 49:675-710, Latchman, Mol. Biotechnol. 1994 October; 2(2):179-95, and Frenkel et al., Gene Ther. 1994; 1 Suppl 1:S40-6, as well as U.S. Pat. Nos. 6,261,552 and 5,599,691.

Retroviral vectors, including lentiviral vectors, also can be advantageous gene delivery vehicles in particular contexts. There are numerous retroviral vectors known in the art. Examples of retroviral vectors are described in, e.g., Miller, Curr Top Microbiol Immunol (1992) 158:1-24; Salmons and Gunzburg (1993) Human Gene Therapy 4:129-141; Miller et al. (1994) Methods in Enzymology 217:581-599, Weber et al., Curr Opin Mol Ther. 2001 October; 3(5):439-53, Hu et al., Pharmacol Rev. 2000 December; 52(4):493-511, Kim et al., Adv Virus Res. 2000; 55:545-63, Palu et al., Rev Med Virol. 2000 May-June; 10(3):185-202, and Takeuchi et al., Adv Ex Med Biol. 2000; 465:23-35, as well as U.S. Pat. Nos. 6,326,195, 5,888,502, 5,580,766, and 5,672,510.

Adenoviral vectors also can be suitable viral vectors for gene transfer. Adenoviral vectors are well known in the art and described in, e.g., Graham et al (1995) Mol Biotechnol 33(3):207-220, Stephenson (1998) Clin Diagn Virol 10(2-3):187-94, Jacobs (1993) Clin Sci (Lond). 85(2):117-22, U.S. Pat. Nos. 5,922,576, 5,965,358 and 6,168,941 and Int'l Patent Appns WO 98/22588, WO 98/56937, WO 99/15686, WO 99/54441, and WO 00/32754. Adenoviral vectors, herpes viral vectors and Sindbis viral vectors, useful in the practice of the invention, are described in, e.g., Jolly (1994) Cancer Gene Therapy 1:51-64, Latchman (1994) Molec Biotechnol 2:179-195, and Johanning et al. (1995) Nucl Acids Res 23:1495-1501.

Other suitable viral vectors include pox viral vectors. Examples of such vectors are discussed in, e.g., Berencsi et al., J Infect Dis (2001)183(8): 1171-9; Rosenwirth et al., Vaccine 2001 Feb. 8; 19(13-14):1661-70; Kittlesen et al., J Immunol (2000) 164(8):4204-11; Brown et al. Gene Ther 2000 7(19):1680-9; Kanesa-thasan et al., Vaccine (2000) 19(4-5):483-91; Sten (2000) Drug 60(2):249-71. Vaccinia virus vectors are preferred pox virus vectors. Examples of such vectors and uses thereof are provided in, e.g., Venugopal et al. (1994) Res Vet Sci 57(2):188-193, Moss (1994) Dev Biol Stand 82:55-63 (1994), Weisz et al. (1994) Mol Cell Biol 43:137-159, Mahr and Payne (1992) Immunobiology 184(2-3):126-146, Hruby (1990) Clin Microbiol Rev 3(2):153-170, and International Patent Applications WO 92/07944, WO 98/13500, and WO 89/08716.

Another aspect of the invention is a flaviviral vector comprising at least one recombinant nucleic acid sequence of the invention (e.g., a recombinant PRM15 or C15 signal peptide, recombinant tE polypeptide-encoding nucleic acid, full E polypeptide-encoding nucleic acid, PRM15/tE polypeptide-encoding nucleic acid, C15/full prM/full E polypeptide-encoding nucleic acid, C15/full prM/tE polypeptide-encoding nucleic acid). The nucleic acid can be positioned in any suitable portion of the flaviviral genome. For example, the nucleic acid can be inserted into or used to replace a nucleotide sequence portion of the genome, typically a nucleotide sequence portion that encodes a similar or equivalent polypeptide as does the nucleic acid. For example, a recombinant C15/full prM/full E polypeptide-encoding nucleic acid can replace all or part wild-type C15/full prM/full E-encoding sequence of the flaviviral genome. Thus, a recombinant polypeptide of the invention can be positioned in a portion of the flaviviral envelope.

In this respect, one or more nucleic acids of the invention can be incorporated into any suitable flaviviral vector. Examples of suitable vectors are described in, e.g., Bonaldo et al., Mem Inst Oswaldo Cruz. 2000; 95 Suppl 1:215-23, Caufour et al. Virus Res. 2001 Nov. 5; 79(1-2):1-14, Guirakhoo et al. J Virol. 2001 August; 75(16):7290-304, Pletnev et al. Virology. 2000 Aug. 15; 274(1):26-31, Guirakhoo et al. J Virol. 2000 June; 74(12):5477-85, and International Patent Applications WO 93/06214 and WO 01/53467. Techniques for constructing recombinant viral vectors and/or modifying known or recombinant viral vectors are disclosed in Sambrook (supra) and other referenced cited herein. Replication-deficient (RD) flaviviruses (including, e.g., RD dengue and yellow fever viruses) also can be useful as vectors or for delivery vehicles. Included is a replication-deficient flavivirus (e.g., RD dengue or YF virus) comprising at least one polypeptide of the invention in place of or in addition to the native flavivirus (e.g., dengue or YF virus) envelope protein or native flavivirus (e.g., dengue or YF virus) prM protein and envelope protein. Also contemplated is a replication-deficient flavivirus comprising at least one nucleic acid of the invention in place of or in addition to a nucleic acid segment of the WT flavivirus genome that encodes the flavivirus envelope protein or the flavivirus prM protein and envelope protein.

A dengue virus that is replication-deficient in mosquito hosts and that can be combined with the polypeptide of the invention (via the nucleic acid of the invention) to serve as a vaccine is described in International Patent Application WO 00/14245. The use of one or more viruses and/or viral vectors of the Flaviviridae family of viruses or, including, e.g., but not limited to, a yellow fever virus or yellow fever virus vector (see, e.g., Guirakhoo et al., J. Virol. 75(16):7290-304 (2001)) to deliver at least one nucleic acid and/or at least one polypeptide of the invention is believed to be advantageous.

In another aspect, the invention provides a chimeric virus comprising a virus of the Flaviviridae family of viruses (such as, e.g., a yellow fever virus, such as yellow fever 17D or the like) in which the complete E protein-encoding nucleic acid sequence(s) or fragment(s) thereof (e.g., a nucleic acid sequence encoding a tE protein) of the virus of the Flaviviridae family (e.g., yellow fever virus) is substituted with a corresponding recombinant E-protein-encoding nucleic acid sequence of the invention (or a recombinant tE-polypeptide-encoding nucleic acid). Such chimeric virus may be an attenuated virus of the Flaviviridae family (e.g., an attenuated yellow fever virus). The invention also includes a chimeric virus of the Flaviviridae family of viruses (e.g., yellow fever virus) in which the E protein-encoding gene(s) of the virus of the Flaviviridae family (e.g., yellow fever virus) (or gene encoding a truncated E protein) is substituted with a recombinant E-protein-encoding nucleotide (or recombinant tE polypeptide-encoding nucleotide) of the invention. Typically, the nucleotide length of the substituted recombinant nucleotide of the invention is substantially equivalent to that of the replaced virus nucleotide sequence. The nucleic acid sequence encoding the truncated E protein typically comprises a nucleotide segment corresponding to the gene encoding the E protein minus the nucleotide residues that encode at least about 8%, 10%, or 12% of the C-terminal amino acid residues of the E protein.

The invention also provides nucleic acids that comprise the genome of a virus of the Flaviviridae family (e.g., yellow fever virus genome or dengue virus genome) in which the nucleotide sequence of the genome encoding the E protein (or a truncE protein) is replaced with a recombinant nucleotide sequence of the invention that encodes a recombinant full length E protein or recombinant truncE polypeptide. Included are substituted nucleic acids (isolated from the virus) and polypeptides encoded by all such nucleic acids.

Similarly, the invention provides a chimeric virus comprising a virus of the Flaviviridae family of viruses (e.g., a yellow fever virus, such as yellow fever 17D, or dengue virus, such as DEN-2, DEN-3) in which a PRM15/truncated E polypeptide-encoding nucleic acid sequence of the virus of the Flaviviridae family (e.g., yellow fever virus) is substituted with a recombinant PRM15/tE polypeptide-encoding nucleic acid sequence of the invention. Also provided is a chimeric virus comprising a virus of the Flaviviridae family of viruses (such as, e.g., yellow fever 17D, DEN-2) in which a C15/full length prM/full length E-polypeptide-encoding nucleic acid sequence of the virus of the Flaviviridae family (e.g., yellow fever virus) is substituted with a recombinant C15/full length prM/full length E polypeptide-encoding nucleic acid sequence of the invention. In both such embodiments, the chimeric virus may be an attenuated virus of the Flaviviridae family (e.g., an attenuated yellow fever virus). The nucleotide length of the substituted recombinant nucleotide is usually substantially equivalent to that of the replaced virus nucleotide sequence. Also included are such substituted nucleic acids (isolated from the virus) and polypeptides encoded by all such nucleic acids.

The invention includes chimeric replication-deficient or attenuated viruses comprising a virus of the Flaviviridae family of viruses with one or more synthetic or recombinant polypeptides or nucleic acids of the invention. Such chimeric viruses may become replication-deficient or attenuated by incorporation of the one or more synthetic or recombinant polypeptides or nucleic acids of the invention. Methods of making replication-deficient or attenuated viruses by substituting portions of the WT flavivirus genome with synthetic or recombinant nucleic acids as described above are included.

Also included is a chimeric virus having a genome comprising a full length chimeric flavivirus genome comprising a nucleotide sequence comprising at least one first nucleic acid of the invention, said at least one first nucleic acid encoding at least one recombinant or synthetic dengue virus structural protein of the invention, wherein said at least one first nucleic acid is linked to at least one second nucleic acid encoding at least one non-structural protein of a second flavivirus, wherein the second flavivirus is not a dengue virus, and wherein the chimeric flavivirus is defined as an approximately 11-kilobase positive strand RNA virus having a genome that codes in one open reading frame for three structural proteins, capsid (C), premembrane (prM) and envelope (E), followed by seven non-structural proteins, NS1, NS2A, NS2B, NS3, NS4A, NS4B and NS5.

Any flavivirus can be modified by the incorporation of at least one nucleic acid of the invention, preferably at the nucleic acid level, using standard molecular biology techniques. Typically, in such aspects, at least a portion of the particular native prM/E-encoding nucleic acid sequences of interest (e.g., native sequence encoding tE, full E, PRM15/tE, or C15/full prM/full E polypeptide) are removed and replaced/substituted with a recombinant nucleic acid of the invention, such that a recombinant polypeptide is encoded and the flavivirus comprises a recombinant polypeptide of the invention in place of or in addition to its native prM/E-encoding sequence interest (e.g., native sequence encoding tE, full E, PRM15/tE, or C15/full prM/full E polypeptide). In one aspect, this technique is performed with non-attenuated flaviviruses by, e.g., replacing at least one nucleotide segment or portion of the native flavivirus genome (e.g., native sequence encoding tE, full E, PRM15/tE, or C15/full prM/full E polypeptide) with at least one recombinant polypeptide of the invention. Such incorporation may produce a chimeric virus that is attenuated. In another aspect, this technique is performed with flaviviruses that are already inactivated or that already comprise a proven attenuated virus genome. In some particular aspects, the attenuated flavivirus is an attenuated dengue virus, e.g., PDK 53, examples of which are described in, e.g., Bhhamarapravati et al., Vaccine 18:44-47 (2000), Men et al., J. Virol. 70(6):3930-37 (1996), Kanesa-thasan, Kanesa-thasan et al., Vaccine 19:3179-88 (20001) and International Patent Applications WO 00/57910, WO 00/57909, WO 00/57908, and WO 00/57904. Low pathogenicity and/or low side-effect attenuated dengue viruses derived from dengue-2 virus strains may be among those used. In one aspect, a dose of about 50 or about 1×10² plaque forming units (pfu) or focus forming units (ffu) to about 6×10¹⁰ pfu or to about 7×10¹⁰ pfu (e.g., about 3.5×10¹⁰ pfu to about 4.5×10¹⁰ pfu) of such a recombinant (e.g., attenuated) flaviviral vector or virus provides an amount effective to induce an immune response in a suitable subject (e.g., an animal, such as a mammal, including a human) to at least one flavivirus (e.g., dengue virus of at least one serotype). Such a dose is administered to the subject by any route described herein (e.g., subcutaneous injection). In another aspect, a dose of about 1×10² to about 5×10⁴ pfu, about 1×10² to about 1.5×10⁴ pfu, about 1×10² to about 1×10³ pfu, or about 1×10³ to about 1×10⁶ pfu or about 1×10⁸ pfu of a recombinant (e.g., attenuated) flaviviral vector or virus (e.g., a recombinant attenuated dengue virus) is effective to induce an immune response to at least one flavivirus (e.g., at least one dengue virus of at least one serotype upon administration to the subject. Alternatively, the minimum lethal dosage (MLD₅₀) equivalent to any above described pfu dosage can be administered to the subject.

The toxicity and therapeutic efficacy of vectors or viruses that include recombinant molecules of the invention are determined using standard pharmaceutical procedures in cell cultures or experimental animals. One can determine the MLD₅₀ (the minimum dose lethal to 50% of the population) and/or the ED₅₀ (the dose therapeutically effective in 50% of the population) using procedures presented herein and those otherwise known in the art. See also S. Plotkin and W. Orenstein, VACCINES (W.B. Saunders Co. 1999 3d ed.) for suggested doses for known flavivirus vaccines, including yellow fever virus 17D vaccine. Nucleic acids, polypeptides, proteins, fusion proteins, transduced cells and other formulations of the present invention can be administered in an amount determined, e.g., by the MLD₅₀ of the formulation, and the side-effects thereof at various concentrations, as applied to the mass and overall health of the patient. Thus, for example, the invention provides a method of inducing an immune response by administering a dose equal or greater to the ED₅₀ of a pharmaceutically acceptable composition comprising a population of recombinant yellow fever virus particles (e.g., 17D vaccine variants) that comprise a recombinant polypeptide or nucleic acid of the invention. Administration can be accomplished via single dose or divided doses (either by co-administration, serial administration, or combinations thereof). Administration techniques and protocols are described in, e.g., Plotkin (VACCINES) supra and other references cited herein. In a related sense, techniques for assessing dosage of the nucleic acid, polypeptide, vector, and cell compositions effective for inducing immunity are described in, e.g., European Patent Application 1 156 333 and references cited therein.

In some aspects, it is preferred that the virus vector is attenuated or replication-deficient in a host cell or host. AAV vectors, which are naturally replication-deficient in the absence of complementing adenoviruses or at least adenovirus gene products (provided by, e.g., a helper virus, plasmid, or complementation cell), are preferred in this respect. By “replication-deficient” is meant that the viral vector comprises a genome that lacks at least one replication-essential gene function. A deficiency in a gene, gene function, or gene or genomic region, as used herein, is defined as a deletion of sufficient genetic material of the viral genome to impair or obliterate the function of the gene whose nucleic acid sequence was deleted in whole or in part. Replication-essential gene functions are those gene functions that are required for replication (i.e., propagation) of a replication-deficient viral vector. The essential gene functions of the viral vector particle vary with the type of viral vector particle at issue. Examples of replication-deficient viral vector particles are described in, e.g., Marconi et al., Proc. Natl. Acad. Sci. USA, 93(21), 11319-20 (1996), Johnson and Friedmann, Methods Cell Biol., 43 (pt. A), 211-30 (1994), Timiryasova et al., J. Gene Med., 3(5), 468-77 (2001), Burton et al., Stem Cells, 19(5), 358-77 (2001), Kim et al., Virology, 282(1), 154-67 (2001), Jones et al., Virology, 278(1), 137-50 (2000), Gill et al., J. Med. Virol., 62(2), 127-39 (2000), Chen and Engleman, J. Virol., 74(17), 8188-93 (2000), Marconi et al., Gene Ther., 6(5), 904-12 (1999), Krisky et al., Gene Ther., 5(11), 1517-30 (1998), Bieniasz et al., Virology, 235(1), 65-72 (1997), Strayer et al., Biotechniques, 22(3), 447-50 (1997), Wyatt et al., Vaccine, 14(15), 1451-8 (1996), and Penciolelli et al., J. Virol., 61(2), 579-83 (1987). Other replication-deficient vectors are based on simple MuLV vectors. See, e.g., Miller et al. (1990) Mol Cell Biol 10:4239 (1990); Kolberg (1992) J NIH Res 4:43, and Cornetta et al. (1991) Hum Gene Ther 2:215).

The basic construction of recombinant viral vectors is well understood in the art and involves using standard molecular biological techniques such as those described in, e.g., Sambrook et al., MOLECULAR CLONING: A LABORATORY MANUAL (Cold Spring Harbor Press 1989) and the third edition thereof (2001), Ausubel et al., CURRENT PROTOCOLS IN MOLECULAR BIOLOGY (Wiley Interscience Publishers 1995), and Watson et al., RECOMBINANT DNA, (2d ed.), and several of the other references mentioned herein, all of which are incorporated herein by reference in their entirety for all purposes. For example, adenoviral vectors can be constructed and/or purified using the methods set forth, for example, in Graham et al., Mol. Biotechnol., 33(3), 207-220 (1995), U.S. Pat. No. 5,965,358, Donthine et al., Gene Ther., 7(20), 1707-14 (2000), and other references described herein. Adeno-associated viral vectors can be constructed and/or purified using the methods set forth, for example, in U.S. Pat. No. 4,797,368 and Laughlin et al., Gene, 23, 65-73 (1983). Similar techniques are known in the art with respect to other viral vectors, particularly with respect to herpes viral vectors (see e.g., Lachman et al., Curr. Opin. Mol. Ther., 1(5), 622-32 (1999)), lentiviral vectors, and other retroviral vectors. In general, the viral vector comprises an insertion of the nucleic acid (for example, a wild-type adenoviral vector can comprise an insertion of up to 3 KB without deletion), or, more typically, comprises one or more deletions of the virus genome to accommodate insertion of the nucleic acid and additional nucleic acids, as desired, and to prevent replication in host cells.

In one aspect, the viral vector desirably is a targeted viral vector, comprising a restricted or expanded tropism as compared to a wild-type viral particle of similar type. Targeting is typically accomplished by modification of capsid and/or envelope proteins of the virus particle. Examples of targeted virus vectors and related principles are described in, e.g., International Patent Applications WO 92/06180, WO 94/10323, WO 97/38723, and WO 01/28569, and WO 00/11201, Engelstadter et al., Gene Ther., 8(15), 1202-6 (2001), van Beusechem et al., Gene Ther., 7(22), 1940-6 (2000), Boerger et al., Proc. Natl. Acad. Sci. USA, 96(17), 9867-72 (1999), Bartlett et al., Nat. Biotechnol., 17(2), 181-6 (1999), Girod et al., Nat. Med., 5(9), 1052-56 (as modified by the erratum in Nat. Med., 5(12), 1438) (1999), J Gene Med. 1999 September-October; 1(5):300-11, Karavanas et al. Crit. Rev Oncol Hematol. 1998 June; 28(1):7-30, Wickham et al., J. Virol., 71(10), 7663-9 (1997), Cripe et al., Cancer Res., 61(7), 2953-60 (2001), van Deutekom et al., J. Gene Med., 1(6), 393-9 (1999), McDonald et al., J. Gene Med., 1(2), 103-10 (1999), Peng, Curr Opin Biotechnol. 1999 October; 10(5):454-7, Staba et al., Cancer Gene Ther., 7(1), 13-9 (2000), Kibbe et al., Arch. Surg., 135(2), 191-7 (2000), Harari et al., Gene Ther., 6(5), 801-7 (2000), and Bouri et al., Hum Gene Ther., 10(10), 1633-40 (1999), and Laquerre et al., J. Virol., 72(12), 9683-97 (1997), Buchholz Curr Opin Mol. Ther. 1999 October; 1(5):613-21, U.S. Pat. Nos. 6,261,554, 5,962,274, 5,695,991, and 6,251,654, and European Patent Appns 1 002 119 and 1 038 967. Particular targeted vectors and techniques for producing such vectors are provided in Int'l Patent Appn WO 99/23107.

The viral vector particle can be a chimeric viral vector particle. Examples of chimeric viral vector particles are described in, e.g., Reynolds et al., Mol. Med. Today, 5(1), 25-31 (1999), Boursnell et al., Gene, 13, 311-317 (1991), Dobbe et al., Virology, 288(2), 283-94 (2001), Grene et al., AIDS Res. Human. Retroviruses, 13(1), 41-51 (1997), Reimann et al., J. Virol., 70(10), 6922-8 (1996), Li et al., J. Virol., 67(11), 6659-66 (1993), Dong et al., J. Virol., 66(12), 7374-82 (1992), Wahlfors, Hum Gene Ther. 1999 May 1; 10(7):1197-206, Reynolds et al., Mol. Med. Today, 5(1), 25-31 (1999), Boursnell et al., Gene, 13, 311-317 (1991) and U.S. Pat. Nos. 5,877,011, 6,183,753, 6,146,643, and 6,025,341.

Non-viral vectors of the invention also can be associated with molecules that target the vector to a particular region in the host (e.g., a particular organ, tissue, and/or cell type). For example, a nucleotide can be conjugated to a targeting protein, such as a viral protein that binds a receptor or a protein that binds a receptor of a particular target (e.g., by a modification of the techniques provided in Wu and Wu, J. Biol. Chem., 263(29), 14621-24 (1988)). Targeted cationic lipid compositions also are known in the art (see, e.g., U.S. Pat. No. 6,120,799). Other techniques for targeting genetic constructs are provided in International Patent Application WO 99/41402.

In a further embodiment, the present invention provides host cells comprising one or more of any of the above-described nucleic acids, vectors, polypeptides, antibodies, fusion proteins, or other constructs of the invention, or any combination of one or more of these. The host cell can be a eukaryotic cell, such as a mammalian cell, a yeast cell, or a plant cell, or the host cell can be a prokaryotic cell, such as a bacterial cell. Introduction of the construct into the host cell can be effected by calcium phosphate transfection, DEAE-Dextran mediated transfection, electroporation, gene or vaccine gun, injection, or other common techniques (see, e.g., Davis, L., Dibner, M., and Battey, I. (1986) BASIC METHODS IN MOLECULAR BIOLOGY) for in vivo, ex vivo, and in vitro methods.

A host cell strain is optionally chosen for its ability to modulate the expression of the inserted sequences or to process the expressed protein in the desired fashion. Such modifications of the protein include, but are not limited to, acetylation, carboxylation, glycosylation, phosphorylation, lipidation and acylation. Post-translational processing that cleaves a “pre” or a “prepro” form of the protein may also be important for correct insertion, folding and/or function. Different host cells such as E. coli, Bacillus sp., yeast or mammalian cells such as CHO, HeLa, BHK, MDCK, HEK 293, WI38, etc. have specific cellular machinery and characteristic mechanisms for such post-translational activities and may be chosen to ensure the correct modification and processing of the introduced foreign protein.

A nucleic acid of the invention can be inserted into an appropriate host cell (in culture or in a host organism) to permit the host to express the protein. Any suitable host cell can be used transformed/transduced by the nucleic acids of the invention. Examples of appropriate expression hosts include: bacterial cells, such as E. coli, Streptomyces, Bacillus sp., and Salmonella typhimurium; fungal cells, such as Saccharomyces cerevisiae, Pichia pastoris, and Neurospora crassa; insect cells such as Drosophila and Spodoptera frugiperda; mammalian cells such as Vero cells, HeLa cells, CHO cells, COS cells, WI38 cells, NIH-3T3 cells (and other fibroblast cells, such as MRC-5 cells), MDCK cells, KB cells, SW-13 cells, MCF7 cells, BHK cells, HEK-293 cells, Bowes melanoma cells, and plant cells, etc. It is understood that not all cells or cell lines need to be capable of producing fully functional polypeptides or fragments thereof; for example, antigenic fragments of the polypeptide may be produced in a bacterial or other expression system. Additional examples of suitable cells are described, for example, in U.S. Pat. No. 5,994,106 and Int'l Patent Application WO 95/34671.

The present invention also provides host cells that are transduced, transformed or transfected with vectors of the invention. A vector of the invention typically comprises a nucleic acid of the invention (e.g., recombinant PRM15 or C15 signal peptide-encoding nucleic acid, tE polypeptide-encoding nucleic acid, full E polypeptide-encoding nucleic acid, PRM15/tE polypeptide-encoding nucleic acid, C15/full prM/full E-encoding nucleic acid). Host cells are genetically engineered (e.g., transduced, transformed or transfected) with the vectors of this invention, which may be, for example, a cloning vector or an expression vector. The vector may be, for example, in the form of a plasmid, a viral particle, a phage, etc. Cells suitable for transduction and/or infection with viral vectors of the invention for production of the recombinant polypeptides of the invention and/or for replication of the viral vector of the invention include the above-described cells.

Examples of cells that have been demonstrated as suitable for packaging of viral vector particles are described in, e.g., Inoue et al., J. Virol., 72(9), 7024-31 (1998), Polo et al., Proc. Natl. Acad. Sci., 96(8), 4598-603 (1999), Farson et al., J. Gene Med., 1(3), 195-209 (1999), Sheridan et al., Mol. Ther., 2(3), 262-75 (2000), Chen et al., Gene Ther., 8(9), 697-703 (2001), and Pizzaro et al., Gene Ther., 8(10), 737-745 (2001). For replication-deficient viral vectors, such as AAV vectors, complementing cell lines, or cell lines transformed with helper viruses, or cell lines transformed with plasmids encoding essential genes, are necessary for replication of the viral vector.

The engineered host cells can be cultured in conventional nutrient media modified as appropriate for activating promoters, selecting transformants, or amplifying the gene of interest. The culture conditions, such as temperature, pH, and the like, are those previously used with the host cell selected for expression, and will be apparent to those skilled in the art and in the references cited herein, including, e.g., ANIMAL CELL TECHNOLOGY, Rhiel et al., eds., (Kluwer Academic Publishers 1999), Chaubard et al., Genetic Eng. News, 20(18) (2000), Hu et al., ASM News, 59, 65-68 (1993), Hu et al., Biotechnol. Prog., 1, 209-215 (1985), Martin et al., Biotechnol., (1987), Freshney, CULTURE OF ANIMAL CELLS: A MANUAL OF BASIC TECHNIQUE, 4^(TH) ED., (Wiley, 2000), Mather, INTRODUCTION TO CELL AND TISSUE CULTURE: THEORY AND TECHNIQUE, (Plenum Press, 1998), Freshney, CULTURE OF IMMORTALIZED CELLS, 3^(RD) ED., (John Wiley & Sons, 1996), CELL CULTURE: ESSENTIAL TECHNIQUES, Doyle et al., eds. (John Wiley & Sons 1998), and GENERAL TECHNIQUES OF CELL CULTURE, Harrison et al., eds., (Cambridge Univ. Press 1997). The nucleic acid also can be contained, replicated, and/or expressed in plant cells. Techniques related to the culture of plant cells are described in, e.g., Payne et al. (1992) PLANT CELL AND TISSUE CULTURE IN LIQUID SYSTEMS John Wiley & Sons, Inc. New York, N.Y.; Gamborg and Phillips (eds.) (1995) PLANT CELL, TISSUE AND ORGAN CULTURE: FUNDAMENTAL METHODS Springer Lab Manual, Springer-Verlag (Berlin Heidelberg New York) and PLANT MOLECULAR BIOLOGY (1993) R. R. D. Croy (ed.) Bios Scientific Publishers, Oxford, U.K. ISBN 0 12 198370 6. Cell culture media in general are set forth in Atlas and Parks (eds.) THE HANDBOOK OF MICROBIOLOGICAL MEDIA (1993) CRC Press, Boca Raton, Fla.

For long-term, high-yield production of recombinant proteins, stable expression can be used. For example, cell lines that stably express a polypeptide of the invention are transduced using expression vectors which comprise viral origins of replication or endogenous expression elements and a selectable marker gene. Following the introduction of the vector, cells may be allowed to grow for 1-2 days in an enriched media before they are switched to selective media. The purpose of the selectable marker is to confer resistance to selection, and its presence allows growth and recovery of cells that successfully express the introduced sequences. For example, resistant clumps of stably transformed cells can be proliferated using tissue culture techniques appropriate to the cell type.

Host cells transformed with an expression vector and/or polynucleotide are optionally cultured under conditions suitable for the expression and recovery of the encoded protein from cell culture. The polypeptide or fragment thereof produced by such a recombinant cell may be secreted, membrane-bound, or contained intracellularly, depending on the sequence and/or the vector used. Expression vectors comprising polynucleotides encoding mature polypeptides of the invention can be designed with signal sequences that direct secretion of the mature polypeptides through a prokaryotic or eukaryotic cell membrane. Principles related to such signal sequences are discussed elsewhere herein.

Cell-free transcription/translation systems can also be employed to produce recombinant polypeptides of the invention or fragments thereof using DNAs and/or RNAs of the present invention or fragments thereof. Several such systems are commercially available. A general guide to in vitro transcription and translation protocols is found in Tymms (1995) IN VITRO TRANSCRIPTION AND TRANSLATION PROTOCOLS: METHODS IN MOLECULAR BIOLOGY Volume 37, Garland Publishing, NY.

The invention further provides a composition comprising at least one polypeptide of the invention, at least one vector of the invention, at least one nucleic acid of the invention, at least one cell of the invention, at least one antibody of the invention, or any combination thereof and a carrier, excipient, or diluent. Such compositions can comprise any suitable amount of any suitable number of polypeptides, fusion proteins, nucleic acids, vectors, and/or cells of the invention. Also provided are pharmaceutical compositions comprising at least one polypeptide, vector, nucleic acid, cell, antibody of the invention, or any combination thereof and a pharmaceutically acceptable carrier, excipient, or diluent.

For example, in one embodiment, the invention provides composition that comprises an excipient or carrier and a plurality of more recombinant polypeptides of the invention (e.g., two, three, four, or more recombinant polypeptide), wherein the composition induces a humoral and/or T cell immune response(s) against at least one flavivirus of at least one serotype (e.g., at least one dengue of at least one dengue virus serotype) in a subject, such as a mammal. Corresponding pharmaceutical compositions comprising a pharmaceutically acceptable excipient or carrier are also provided.

In another aspect, the invention provides compositions (including pharmaceutical compositions) that comprise an excipient or carrier (or pharmaceutically acceptable excipient or carrier) and a plurality of more dengue antigens (e.g., two, three, four, or more antigens), wherein at least one of the antigen is a recombinant polypeptide of the invention and the composition induces a humoral and/or T cell immune response(s) against at least one dengue virus of at least one serotype in a subject, such as a mammal. More preferably, the combined dengue antigen composition induces a protective immune response(s) against one or more dengue viruses of at least two, three, or all four virus serotypes in a subject.

In another aspect, the invention provides a composition (or pharmaceutical composition) comprising: (1) an excipient or carrier (or pharmaceutically acceptable excipient or carrier); (2) a polynucleotide comprising a nucleic acid sequence, that when expressed in a subject (e.g., mammal), produces a recombinant polypeptide of the invention); and (3) at least one additional nucleic acid sequence encoding an WT or recombinant dengue virus antigen and/or at least one WT or recombinant dengue virus antigen polypeptide. Such a recombinant or WT dengue virus antigen may in the form of, e.g., a WT or recombinant tE polypeptide, full E polypeptide, PRM15/tE polypeptide, C15/full prM/full E polypeptide, PRM15/full E polypeptide, or C15/full prM/tE polypeptide. Such composition induces a humoral and/or T cell response(s), and preferably a protective immune response against one or more dengue viruses of multiple virus serotypes in a subject. In such compositions, the recombinant nucleic acid of the invention, which encodes a recombinant polypeptide of the invention, and the nucleic acid sequence encoding at least one additional dengue virus antigen(s) can be in the same polynucleotide, or located on two or more different or separate polynucleotides, and, if desired, the various polynucleotide sequences can be isolated or positioned in one or more suitable vectors. In such compositions, the at least one additional dengue antigen co-administered and/or co-expressed with the recombinant polypeptide of the invention can be a recombinant polypeptide of the invention, a naturally occurring WT dengue virus antigen, or a known variant of a naturally occurring dengue virus antigen (e.g., a hybrid DEN-2/DEN-3 envelope as described in Bielefeldt-Ohmann et al., J. Gen Virol 78(11):2723-2733 (1997)). In one aspect, the at least one additional dengue antigen comprises at least one naturally occurring epitope (e.g., T cell epitope), such that the composition induces an immune response (e.g., T cell response) that is essentially equivalent to the immune response (e.g., T cell response) induced by a corresponding WT dengue virus antigen of the same or similar format (e.g., WT tE polypeptide, full E polypeptide, PRM15/tE polypeptide, C15/full prM/full E polypeptide, PRM15/full E polypeptide, or C15/full prM/tE polypeptide). Usually, in all of the analyses described throughout, for proper comparison, a recombinant dengue virus antigen is compared with WT dengue virus antigen having the same or a substantially similar format and/or size (e.g., a recombinant PRM15/tE polypeptide is compared with a WT PRM15/tE polypeptide). Similarly, a nucleic acid encoding a recombinant dengue virus antigen of a particular format and/or size is compared with a nucleic acid encoding a WT dengue virus antigen having the same or substantially similar format or size.

In one aspect, a composition of the invention includes a polynucleotide comprising a first nucleic acid sequence encoding a polypeptide of the invention that induces a neutralizing antibody response against at least one dengue virus of each of at least two dengue virus serotypes (and preferably against one or more dengue viruses of all four serotypes) and a plurality of additional nucleic acid sequences encoding peptides comprising known virus epitopes (e.g., T cell epitopes) from DEN-1, DEN-2, DEN-3, and/or DEN-4.

Additionally or alternatively, the composition can include one or more polypeptides of the invention selected for the retention of at least one known wild-type dengue epitope (e.g., T cell epitope). Such wild-type dengue epitopes are known in the art and include, e.g., the regions of the DEN-2 envelope protein comprising from about amino acid residues 35-50, 59-78, 135-157, 145-169, 240-250, 270-298, 295-307, 335-354, and/or 356-376 (see, e.g., Rothman et al., J Virol 70(10):6540-6546 (1996), Leclerc et al. Mol Immunol 30(7):613-625 (1993), and Roehrig et al. Virology 191(1):31-38 (1994)). Additionally, epitopes (e.g., T cell epitopes) within dengue E and prM proteins can be identified by epitope analysis (e.g., T cell epitope analysis) by one of ordinary skill using programs and algorithms known in the art, examples of which are further described herein, and by subsequent sequence comparison to identify polypeptide(s) of the invention that retain such identified epitopes for addition to the composition.

Desirably, a pharmaceutical composition of the invention comprises a pharmaceutically acceptable excipient or carrier and an antigenic or immunogenic amount of at least one recombinant polypeptide, polynucleotide, or vector of the invention (or a combination of any of these) sufficient to induce a immune response to at least one flavivirus (e.g., a dengue virus) of at least one serotype in a subject to which the pharmaceutical composition is administered in vivo or via ex vivo methods. In one particular aspect, the an amount of at least recombinant polypeptide, polynucleotide, or vector of the invention in the pharmaceutical composition is sufficient to induce a protective immune response in a subject to which the pharmaceutical composition is administered; that is, the amount is sufficient to protect against infection by the at least one flavivirus of at least one serotype. In another aspect, the pharmaceutical composition comprises a pharmaceutically acceptable excipient or carrier and an antigenic or immunogenic amount of at least one recombinant polypeptide, polynucleotide, or vector of the invention (or a combination of any of these) sufficient to induce a immune response in a subject to which the composition is delivered against at least one dengue virus of at least two, at least three, or four virus serotypes.

The composition (or pharmaceutical composition) can be any non-toxic composition that does not interfere with the immunogenicity of the at least one polypeptide, polynucleotide, or vector of the invention included therein. The composition can comprise one or more excipients or carriers, and the pharmaceutical composition comprises one or more pharmaceutically acceptable carriers. A wide variety of acceptable carriers, diluents, and excipients are known in the art. There are a wide variety of suitable formulations of compositions and pharmaceutical compositions of the present invention. For example, a variety of aqueous carriers can be used, e.g., buffered saline, such as phosphate-buffered saline (PBS), and the like are advantageous in injectable formulations of the polypeptide, polynucleotide, and/or vector of the invention. These solutions are preferably sterile and generally free of undesirable matter. These compositions may be sterilized by conventional, well known sterilization techniques. The compositions may comprise pharmaceutically acceptable auxiliary substances as required to approximate physiological conditions, such as, e.g., pH adjusting and buffering agents, toxicity adjusting agents and the like, for example, sodium acetate, sodium chloride, potassium chloride, calcium chloride, sodium lactate and the like. Any suitable carrier can be used in the administration of the polynucleotide, polypeptide, and/or vector of the invention, and several carriers for administration of therapeutic proteins are known in the art.

The composition, pharmaceutical composition and/or pharmaceutically acceptable carrier also can include diluents, fillers, salts, buffers, detergents (e.g., a nonionic detergent, such as Tween-80), stabilizers, stabilizers (e.g., sugars or protein-free amino acids), preservants, tissue fixatives, solubilizers, and/or other materials suitable for inclusion in a pharmaceutically composition. Examples of suitable components of the pharmaceutical composition in this respect are described in, e.g., Berge et al., J. Pharm. Sci., 66(1), 1-19 (1977), Wang and Hanson, J. Parenteral. Sci. Tech., 42, S4-S6 (1988), U.S. Pat. Nos. 6,165,779 and 6,225,289, and elsewhere herein. The pharmaceutical composition also can include preservatives, antioxidants, or other additives known to those of skill in the art. Additional pharmaceutically acceptable carriers are known in the art. Examples of additional suitable carriers are described in, e.g., Urquhart et al., Lancet, 16, 367 (1980), Lieberman et al., PHARMACEUTICAL DOSAGE FORMS—DISPERSE SYSTEMS (2nd ed., vol. 3, 1998), Ansel et al., PHARMACEUTICAL DOSAGE FORMS & DRUG DELIVERY SYSTEMS (7th ed. 2000), Martindale, THE EXTRA PHARMACOPEIA (31st edition), Remington's PHARMACEUTICAL SCIENCES (16th-20th editions), THE PHARMACOLOGICAL BASIS OF THERAPEUTICS, Goodman and Gilman, Eds. (9th ed.—1996), WILSON AND GISVOLDS TEXTBOOK OF ORGANIC MEDICINAL AND PHARMACEUTICAL CHEMISTRY, Delgado and Remers, Eds. (10th ed.—1998), and U.S. Pat. Nos. 5,708,025 and 5,994,106. Principles of formulating pharmaceutically acceptable compositions are described in, e.g., Platt, Clin. Lab Med., 7, 289-99 (1987), Aulton, PHARMACEUTICS: THE SCIENCE OF DOSAGE FORM DESIGN, Churchill Livingstone (New York) (1988), EXTEMPORANEOUS ORAL LIQUID DOSAGE PREPARATIONS, CSHP (1998), and “Drug Dosage,” J. Kans. Med. Soc., 70(1), 30-32 (1969). Additional pharmaceutically acceptable carriers particularly suitable for administration of vectors are described in, for example, International Patent Application WO 98/32859.

The composition or pharmaceutical composition of the invention can comprise or be in the form of a liposome. Suitable lipids for liposomal formulation include, without limitation, monoglycerides, diglycerides, sulfatides, lysolecithin, phospholipids, saponin, bile acids, and the like. Preparation of such liposomal formulations is described in, e.g., U.S. Pat. Nos. 4,837,028 and 4,737,323.

The form of the compositions or pharmaceutical composition can be dictated, at least in part, by the route of administration of the polypeptide, polynucleotide, cell, and/or vector of interest. Because numerous routes of administration are possible, the form of the pharmaceutical composition and/or components thereof can vary. For example, in transmucosal or transdermal administration, penetrants appropriate to the barrier to be permeated are preferably included in the composition. Such penetrants are generally known in the art, and include, for example, for transmucosal administration, detergents, bile salts, and fusidic acid derivatives. In contrast, in transmucosal administration can be facilitated through the use of nasal sprays or suppositories.

A common administration form for compositions, including pharmaceutical compositions, comprising the polypeptides and/or polynucleotides of the invention is by injection. Injectable pharmaceutically acceptable compositions comprise one or more suitable liquid carriers such as water, petroleum, physiological saline, bacteriostatic water, Cremophor ELTM (BASF, Parsippany, N.J.), phosphate buffered saline (PBS), or oils. Liquid pharmaceutical compositions can further include physiological saline solution, dextrose (or other saccharide solution), polyols, or glycols, such as ethylene glycol, propylene glycol, PEG, coating agents which promote proper fluidity, such as lecithin, isotonic agents, such as mannitol or sorbitol, organic esters such as ethyoleate, and absorption-delaying agents, such as aluminum monostearate and gelatins. Preferably, the injectable composition is in the form of a pyrogen-free, stable, aqueous solution. Preferably, the injectable aqueous solution comprises an isotonic vehicle such as sodium chloride, Ringer's injection solution, dextrose, lactated Ringer's injection solution, or an equivalent delivery vehicle (e.g., sodium chloride/dextrose injection solution). Formulations suitable for injection by intraarticular (in the joints), intravenous, intramuscular, intradermal, subdermal, intraperitoneal, and subcutaneous routes, include aqueous and non-aqueous, isotonic sterile injection solutions, which can include antioxidants, buffers, bacteriostats, and solutes that render the formulation isotonic with the blood of the intended recipient (e.g., PBS and/or saline solutions, such as 0.1 M NaCl), and aqueous and non-aqueous sterile suspensions that can include suspending agents, solubilizers, thickening agents, stabilizers, and preservatives.

The administration of a polypeptide, polynucleotide, or vector of the invention can be facilitated by a delivery device be formed of any suitable material. Examples of suitable matrix materials for producing non-biodegradable administration devices include hydroxapatite, bioglass, aluminates, or other ceramics. In some applications, a sequestering agent, such as carboxymethylcellulose (CMC), methylcellulose, or hydroxypropylmethylcellulose (HPMC), can be used to bind the polypeptide, polynucleotide, or vector to the device for localized delivery.

In another aspect, a polynucleotide or vector of the invention can be formulated with one or more poloxamers, polyoxyethylene/polyoxypropylene block copolymers, or other surfactants or soap-like lipophilic substances for delivery of the polynucleotide or vector to a population of cells or tissue or skin of a subject in vivo, ex vivo, or in in vitro systems. See e.g., U.S. Pat. Nos. 6,149,922, 6,086,899, and 5,990,241, each of which is incorporated herein by reference in its entirety for all purposes.

Vectors and polynucleotides of the invention can be desirably associated with one or more transfection-enhancing agents. In some embodiments, a nucleic acid and/or nucleic acid vector of the invention typically is associated with stability-promoting salts, carriers (e.g., PEG), and/or formulations that aid in transfection (e.g., sodium phosphate salts, dextran carriers, iron oxide carriers, or biolistic delivery (“gene gun”) carriers, such as gold bead or powder carriers) (see, e.g., U.S. Pat. No. 4,945,050). Additional transfection-enhancing agents include viral particles to which the nucleic acid/nucleic acid vector can be conjugated, a calcium phosphate precipitating agent, a protease, a lipase, a bipuvicaine solution, a saponin, a lipid (preferably a charged lipid), a liposome (preferably a cationic liposome, examples of which are described elsewhere herein), a transfection facilitating peptide or protein-complex (e.g., a poly(ethylenimine), polylysine, or viral protein-nucleic acid complex), a virosome, or a modified cell or cell-like structure (e.g., a fusion cell).

Nucleic acids of the invention can also be delivered by in vivo or ex vivo electroporation methods, including, e.g., those described in U.S. Pat. Nos. 6,110,161 and 6,261,281 and Widera et al., J. of Immunol. 164: 4635-4640 (2000), each of which is incorporated herein by reference in its entirety for all purposes.

Transdermal administration of at least one recombinant polypeptide, polynucleotide, and/or vector of the invention can be facilitated by a transdermal patch comprising the at least one polypeptide, polynucleotide, and/or vector in any suitable composition in any suitable form. Such transdermal patch devices are provided by the invention. For example, the at least one polypeptide, polynucleotide, and/or vector can be contained in a liquid reservoir in a drug reservoir patch device, or, alternatively, the polypeptide and/or polynucleotide can be dispersed throughout a material suitable for incorporation in a simple monolithic transdermal patch device. Typically, the patch comprises an immunogenic or antigenic amount of the polypeptide. Examples of such patch devices are known in the art. The patch device can be either a passive device or a device capable of iontophoretic delivery of the at least polypeptide, polynucleotide, and/or vector to the skin or tissue of the subject. Methods of promoting immunity to at least one dengue virus of at least one serotype in a subject comprise administering such a transdermal patch to the skin of the subject for a period of time and under conditions sufficient to promote immunity to the at least one dengue virus.

The composition, particularly the pharmaceutical composition, desirably comprises an amount of at least one polynucleotide, polypeptide, and/or vector in a dose sufficient to induce a protective immune response in a subject, preferably a human, upon administration. The composition can comprise any suitable dose of the at least one polypeptide, polynucleotide, and/or vector. Proper dosage can be determined by any suitable technique. In a simple dosage testing regimen, low doses of the composition are administered to a test subject or system (e.g., an animal model, cell-free system, or whole cell assay system). Considerations in dosing for immunogenic polypeptide, polynucleotide, and/or vector compositions (as well as for gene transfer by viral vectors) are known in the art. Briefly, dosage is commonly determined by the efficacy of the particular nucleic acid, polypeptide, and/or vector, the condition of the patient, as well as the body weight and/or target area of the patient to be treated. The size of the dose is also determined by the existence, nature, and extent of any adverse side-effects that accompany the administration of any such particular polypeptide, nucleic acid, vector, formulation, composition, transduced cell, cell type, or the like in a particular patient. Principles related to dosage of therapeutic and prophylactic agents are provided in, e.g., Platt, Clin. Lab Med., 7, 289-99 (1987), “Drug Dosage,” J. Kans. Med. Soc., 70(1), 30-32 (1969), and other references described herein (e.g., Remington's, supra).

Typically, a nucleic acid composition of the invention comprises from about 1 μg to about 10 mg of at least one nucleic acid of the invention, including about 1 μg to about 15 mg, including about 1 μg to about 10 mg, about 500 μg to about 10 mg, about 500 μg to about 5 mg, about 1 mg to about 5 mg, about 2 mg to about 5 mg, about 1 μg to about 2 mg, including about 1 μg to about 1 mg, about 1 μg to about 500 μg, 1 μg to about 100 μg, 1 μg to about 50 μg, and 1 μg to about 10 μg of the nucleic acid. For delivery of a vector comprising a nucleic acid of the invention, the same amount(s) can be administered. In one aspect, the composition to be administered to a host comprises about 1, 2, 5, or 10 mg of a nucleic acid or vector of the invention. A mixture of two or more nucleic acids of the invention (or mixture of two or more vectors, each encoding a nucleic acid of the invention) can be administered in such amount(s). The volume of carrier or diluent in which such nucleic acid is administered depends upon the amount of nucleic acid to be administered. For example; 2 mg nucleic acid is typically administered in a 1 mL volume of carrier or diluent. The amount of nucleic acid in the composition depends on the host to which the nucleic acid composition is to be administered, the characteristics of the nucleic acid (e.g., gene expression level as determined by the encoded peptide, codon optimization, and/or promoter profile), and the form of administration. For example, biolistic or “gene gun” delivery methods of as little as about 1 μg of nucleic acid dispersed in or on suitable particles is effective for inducing an immune response even in large mammals such as humans. In some instances, biolistic delivery of at least about 5 μg, more preferably at least about 10 μg, or more nucleic acid may be desirable. Biolistic delivery of nucleic acids is discussed further elsewhere herein.

For injection of a nucleic acid composition, a larger dose of nucleic acid typically will be desirable. In general, an injectable nucleic acid composition comprises at least about 1 μg nucleic acid, typically about 5 μg nucleic acid, more typically at least about 25 μg of nucleic acid or at least about 30 μg of nucleic acid, 50 μg of nucleic acid, usually at least about 75 μg or at least about 80 μg of the nucleic acid, preferably at least about 100 μg or at least about 150 μg nucleic acid, preferably at least about 500 μg, at least about 1 mg, at least about 2 mg nucleic acid, at least about 5 mg nucleic acid, or more. In some instances, the injectable nucleic acid composition may comprise about 0.25-5 mg of the nucleic acid, typically in a volume of diluent, carrier, or excipient of about 0.5-1 mL. Commonly, an injectable nucleic acid solution comprises about 0.5 mg, about 1 mg, 1.5 mg, or even about 2 mg nucleic acid, usually in a volume of about 0.25 mL, about 0.5 mL, 0.75 mL, or about 1 mL. In one aspect, 2 mg nucleic acid is typically administered in a 1 mL volume of carrier, diluent, or excipient (e.g., PBS or saline). However, in some instances, lower injectable doses (e.g., less than about 5 μg, such as, e.g., about 4 μg, about 3, about 2 μg, or about 1 μg) of the polynucleotide of the invention are about equally or more effective in producing an antibody response than the above-described higher doses.

A viral vector composition of the invention can comprise any suitable number of viral vector particles. The dosage of viral vector particles or viral vector particle-encoding nucleic acid depends on the type of viral vector particle with respect to origin of vector (e.g., whether the vector is an alphaviral vector, papillomaviral vector, HSV vector, and/or an AAV vector), whether the vector is a transgene expressing or recombinant peptide displaying vector, the host, and other considerations discussed above. Generally, with respect to gene transfer vectors, the pharmaceutically acceptable composition comprises at least about 1×10² viral vector particles in a volume of about 1 mL (e.g., at least about 1×10² to about 1×10⁸ particles in about 1 mL). Higher dosages also can be suitable (e.g., at least about 1×10⁶, about 1×10⁸, about 1×10⁹, about 1×10¹⁰, or more particles/mL).

Nucleic acid compositions of the invention can comprise additional nucleic acids. For example, a nucleic acid can be co-administered with a second immunostimulatory sequence or a second cytokines/adjuvant-encoding sequence (e.g., a sequence encoding an IFN-gamma and/or a GM-CSF). Examples of such sequences are described above.

The invention further provides a composition comprising a plurality of VLPs (of at least one type) formed from, e.g., recombinant C15/full prM/full E polypeptides or full prM/full E polypeptides of the invention. Desirably, the composition comprises a dose of VLPs sufficient to induce protective immunity in a subject, such as a mammalian host. Dosage considerations for VLPs are similar to those described above with respect to viral vector particles and other compositions of the invention.

The invention also provides a composition comprising an aggregate of two or more polypeptides of the invention. Moreover, the invention provides a composition comprising a population of one or more multimeric polypeptides of the invention. In particular, recombinant dengue antigens of the invention can form dimers (and in some instances trimers) in certain conditions and can retain such a multimeric state in a subject, e.g., mammalian host, as shown in the Examples below.

The invention further provides methods of making and using the polypeptides, polynucleotides, vectors, and cells of the invention. In one aspect, the invention provides a method of making a recombinant polypeptide of the invention by introducing a nucleic acid of the invention into a population of cells in a culture medium, culturing the cells in the medium (for a time and under conditions suitable for desired level of gene expression) to produce the polypeptide, and isolating the polypeptide from the cells, culture medium, or both. The polypeptide can be isolated from cell lysates, and/or cell culture medium by first concentrating the culture medium using centrifugal filters (Amicon), alternatively, by precipitating the polypeptides with ammonium sulfate or polyethylene glycol and then resuspending the polypeptides in PBS or other suitable buffers. The polypeptides can then be purified using either size-exclusion chromatography on Sephacryl S-400 column (Amersham Biosciences) as described in, e.g., Hjorth, R. and J. Moreno-Lopez. 1982, J. Virol. Methods 5:151-158, or another affinity chromatography, or by centrifugation through 20-60% sucrose gradients as described in, e.g., Konish, E., S. et al., 1992, Virology 188:714-720 (see FIGS. 15A-15B). Fractions containing the desired polypeptides can be identified by ELISA or SDS-PAGE followed by protein silver stain and immunoblotting. The desired fractions are pooled and further concentrated. Sucrose in gradient centrifugation fractions can be removed using PD-10 column (Amersham Biosciences) gel filtration. Additional purification techniques include hydrophobic interaction chromatography (Diogo, M. M, et al., 2001, J Gene Med. 3:577-584) or any other suitable technique known in the art. A variety of polypeptide purification methods are well known in the art, including those set forth in, e.g., Sandana (1997) BIOSEPARATION OF PROTEINS, Academic Press, Inc., Bollag et al. (1996) PROTEIN METHODS, 2^(nd) Edition Wiley-Liss, NY, Walker (1996) THE PROTEIN PROTOCOLS HANDBOOK Humana Press, NJ, Harris and Angal (1990) PROTEIN PURIFICATION APPLICATIONS: A PRACTICAL APPROACH IRL Press at Oxford, Oxford, England, Scopes (1993) PROTEIN PURIFICATION: PRINCIPLES AND PRACTICE 3^(rd) Edition Springer Verlag, NY, Janson and Ryden (1998) PROTEIN PURIFICATION: PRINCIPLES, HIGH RESOLUTION METHODS AND APPLICATIONS, Second Edition Wiley-VCH, NY; and Walker (1998) PROTEIN PROTOCOLS ON CD-ROM Humana Press, NJ. Cells suitable for polypeptide production are known in the art and are discussed elsewhere herein (e.g., Vero cells, 293 cells, BHK, CHO, and COS cells can be suitable). Cells can be lysed by any suitable technique including, e.g., sonication, microfluidization, physical shear, French press lysis, or detergent-based lysis. The invention provides a similar method of making a polypeptide of the invention comprising inserting a vector according to the invention to the cells, culturing the cells under appropriate conditions for expression of the nucleic acid from the vector, and isolating the polypeptide from the cells, culture medium, or both. The cells chosen are based on the desired processing of the polypeptide and based on the appropriate vector (e.g., E. coli cells can be preferred for bacterial plasmids, whereas 293 cells can be preferred for mammalian shuttle plasmids and/or adenoviruses, particularly E1-deficient adenoviruses).

In addition to recombinant production, the polypeptides may be produced by direct peptide synthesis using solid-phase techniques (see, e.g., Stewart et al. (1969) SOLID-PHASE PEPTIDE SYNTHESIS, WH Freeman Co, San Francisco and Merrifield J. (1963) J Am Chem Soc 85:2149-2154). Peptide synthesis may be performed using manual techniques or by automation. Automated synthesis may be achieved, for example, using Applied Biosystems 431A Peptide Synthesizer (Perkin Elmer, Foster City, Calif.) in accordance with the instructions provided by the manufacturer. For example, subsequences may be chemically synthesized separately and combined using chemical methods to produce a polypeptide of the invention or fragments thereof. Alternatively, synthesized polypeptides may be ordered from any number of companies that specialize in production of polypeptides. Most commonly, polypeptides of the invention are produced by expressing coding nucleic acids and recovering polypeptides, e.g., as described above.

In another aspect, the invention provides a method of producing a polypeptide of the invention comprising introducing a nucleic acid of the invention, a vector of the invention, or a combination thereof, into a subject, which typically and preferably is a mammal (e.g., a rat, a nonhuman primate, a bat, a marmoset, a pig, or a chicken), such that a polypeptide of the invention is expressed in the subject, and the polypeptide is isolated from the animal or from a byproduct of the subject. Isolation of the polypeptide from the subject, e.g., animal, or animal byproduct can be by any suitable technique, depending on the subject and desired recovery strategy. For example, the polypeptide can be recovered from sera of mice, monkeys, or pigs expressing the polypeptide of the invention. Transgenic animals (which preferably are mammals, such as the aforementioned mammals) comprising at least one nucleic acid of the invention also are provided. The transgenic animal can have the nucleic acid integrated into its host genome (e.g., by an AAV vector, lentiviral vector, biolistic techniques performed with integration-promoting sequences, etc.) or can have the nucleic acid in maintained epichromosomally (e.g., in a non-integrating plasmid vector or by insertion in a non-integrating viral vector). Epichromosomal vectors can be engineered for more transient gene expression than integrating vectors. RNA-based vectors offer particular advantages in this respect.

The invention additionally provides a method of producing at least one antibody that binds to at least a portion of a dengue virus. The invention further provides a method of producing at least one antibody that binds to at least one dengue virus of at least one serotype, preferably binds to one or more dengue viruses of each of two or three serotypes, and more preferably binds to one or more dengue viruses of all four virus serotypes, which comprises administering an effective amount (e.g., antigenic or immunogenic amount) of at least one recombinant polypeptide of the invention or an antigenic or immunogenic fragment thereof, or an effective amount of a vector or nucleic acid encoding such at least one polypeptide, or composition comprising an effective amount of such at least one polypeptide or nucleic acid or polynucleotide encoding said at least polypeptide, to a suitable animal host or host cell. The host cell is cultured or the animal host is maintained under conditions permissive for formation of antibody-antigen complexes. Subsequently produced antibodies are recovered from the cell culture, the animal, or a byproduct of the animal (e.g., sera from a mammal). The production of antibodies can be carried out with either at least one polypeptide of the invention, or a peptide or polypeptide fragment thereof comprising at least about 10 amino acids, preferably at least about 15 amino acids (e.g., about 20 amino acids), and more preferably at least about 25 amino acids (e.g., about 30 amino acids) or more in length. Alternatively, a nucleic acid or vector can be inserted into appropriate cells, which are cultured for a sufficient time and under periods suitable for transgene expression, such that a nucleic acid sequence of the invention is expressed therein resulting in the production of antibodies that bind to the recombinant antigen encoded by the nucleic acid sequence. Antibodies thereby obtained can have diagnostic and/or prophylactic uses. The provision of such antibodies, and compositions and pharmaceutical compositions comprising such antibodies (by use of the principles described above with respect to other compositions and pharmaceutically acceptable compositions) are features of the invention.

Antibodies produced in response to at least one polypeptide of the invention, fragment thereof, or the expression of such at least one polypeptide by a vector and/or polynucleotide of the invention can be any suitable type of antibody or antibodies. Antibodies provided by the invention include, e.g., polyclonal antibodies, monoclonal antibodies, chimeric antibodies, humanized antibodies, single chain antibodies, Fab fragments, and fragments produced by a Fab expression library. Methods of producing polyclonal and monoclonal antibodies are known to those of skill in the art, and many antibodies are available. See, e.g., CURRENT PROTOCOLS IN IMMUNOLOGY, John Colligan et al., eds., Vols. I-IV (John Wiley & Sons, Inc., NY, 1991 and 2001 Supplement), and Harlow and Lane (1989) ANTIBODIES: A LABORATORY MANUAL Cold Spring Harbor Press, NY, Stites et al. (eds.) BASIC AND CLINICAL IMMUNOLOGY (4th ed.) Lange Medical Publications, Los Altos, Calif., and references cited therein, Goding (1986) MONOCLONAL ANTIBODIES: PRINCIPLES AND PRACTICE (2d ed.) Academic Press, New York, N.Y., and Kohler and Milstein (1975) Nature 256:495-497. Other suitable techniques for antibody preparation include selection of libraries of recombinant antibodies in phage or similar vectors. See, Huse et al. (1989) Science 246:1275-1281; and Ward et al. (1989) Nature 341:544-546. Specific monoclonal and polyclonal antibodies and antisera will usually bind with a K_(D) of at least about 0.1 μM, preferably at least about 0.01 μM or better, and preferably, 0.001 μM or better.

Detailed methods for preparation of chimeric (humanized) antibodies can be found in U.S. Pat. No. 5,482,856. Additional details on humanization and other antibody production and engineering techniques can be found in Borrebaeck (ed.) (1995) ANTIBODY ENGINEERING, 2^(nd) Ed. Freeman and Co., NY (Borrebaeck); McCafferty et al. (1996) ANTIBODY ENGINEERING, A PRACTICAL APPROACH IRL at Oxford Press, Oxford, England (McCafferty), and Paul (1995) ANTIBODY ENGINEERING PROTOCOLS Humana Press, Towata, N.J. (Paul).

Humanized antibodies are especially desirable in applications where the antibodies are used as therapeutics and/or prophylactics in vivo in mammals (e.g., such as humans) and ex vivo in cells or tissues that are delivered to or transplanted into mammals (e.g., humans). Human antibodies consist of characteristically human immunoglobulin sequences. The human antibodies of this invention can be produced in using a wide variety of methods (see, e.g., Larrick et al., U.S. Pat. No. 5,001,065, and Borrebaeck McCafferty and Paul, supra, for a review). In one embodiment, the human antibodies of the present invention are produced initially in trioma cells. Genes encoding the antibodies are then cloned and expressed in other cells, such as nonhuman mammalian cells. The general approach for producing human antibodies by trioma technology is described by Ostberg et al. (1983), Hybridoma 2:361-367, Ostberg, U.S. Pat. No. 4,634,664, and Engelman et al., U.S. Pat. No. 4,634,666. The antibody-producing cell lines obtained by this method are called triomas because they are descended from three cells—two human and one mouse. Triomas have been found to produce antibody more stably than ordinary hybridomas made from human cells.

Additional useful techniques for preparing antibodies are described in, e.g., Gavilodono et al., Biotechniques 29(1):128-32, 134-6, and 138 (passim) (2000), Nelson et al., Mol. Pathol. 53(3): 111-7 (2000), Laurino et al. Ann. Clin. Lab. Sci. 29(3): 158-66 (1999), Rapley, Mol. Biotechnol. 3(2):139-54 (1995), Zaccolo et al., Int. J. Clin. Lab. Res. 23(4):192-8 (1993), Morrison, Annu. Rev. Immunol. 10:239-65 (1992), “Antibodies, Annigene, and Molecular Mimiery,” Meth. Enzymd. 178 (J. J. Langone, Ed. 1989), Moore, Clin. Chem., 35(9): 1849-53 (1989), Rosalki et al., Clin. Chim. Acta 183(1):45-58 (1989), and Tami et al., Am. J. Hosp. Pharm. 43(11):2816-25 (1986), as well as U.S. Pat. Nos. 4,022,878, 4,350,683, and 4,022,878. A technique for producing antibodies with remarkably high binding affinities is provided in Border et al., Proc. Natl. Acad. Sci., USA 97(20): 10701-05 (2000).

The invention further provides a method of promoting, inducing, enhancing or modulating, a mammal's immune response to at least one dengue virus of at least one serotype comprising administering an immunogenic amount of at least one polypeptide to a mammal, such as human and non-human primates, such that an immune response to the at least one dengue virus of the at least one serotype in the mammal is promoted, induced, enhanced, or modulated. Preferably, the polypeptide is administered in a pharmaceutical composition comprising the polypeptide of the invention and a pharmaceutically acceptable carrier or excipient as described above. Typically, an injectable, pharmaceutical composition comprising a suitable, pharmaceutically acceptable carrier (e.g., PBS) and an immunogenic amount of the polypeptide is delivered intramuscularly, intraperitoneally, subdermally, transdermally, subcutaneously, or intradermally to the host for in vivo. Alternatively, biolistic protein delivery techniques (vaccine gun delivery) can be used (examples of which are discussed elsewhere herein). Any other suitable technique also can be used. Polypeptide administration can be facilitated via liposomes (examples further discussed below).

The invention also provides a method promoting an immune response to a dengue virus in a subject by administering an antigenic or immunogenic amount of at least one nucleic acid of the invention and/or at least one nucleic acid vector (NAV) of the invention, preferably in a pharmaceutical composition comprising a pharmaceutically acceptable carrier and an antigenic immunogenic amount of the at least one nucleic acid and/or at least one nucleic acid vector, to the subject.

While the following discussion is primary directed to nucleic acids, it will be understood that it applies equally (and, indeed, preferably) to nucleic acid vectors of the invention. The nucleic acid composition can be administered or delivered to the host by any suitable administration route. In some aspects of the invention, administration of the nucleic acid is parenteral (e.g., subcutaneous, intramuscular, or intradermal), topical, or transdermal. The nucleic acid can be introduced directly into a tissue, such as muscle, by injection using a needle or other similar device. See, e.g., Nabel et al. (1990), supra); Wolff et al. (1990) Science, 247:1465-1468), Robbins (1996) GENE THERAPY PROTOCOLS, Humana Press, NJ, and Joyner (1993) GENE TARGETING: A PRACTICAL APPROACH, IRL Press, Oxford, England, and U.S. Pat. Nos. 5,580,859 and 5,589,466. Other methods such as “biolistic” or particle-mediated transformation (see, e.g., U.S. Pat. No. 4,945,050, U.S. Pat. No. 5,036,006, Sanford et al., J. Particulate Sci. Tech., 5, 27-37 (1987), Yang et al., Proc. Natl. Acad. Sci. USA, 87, 9568-72 (1990), and Williams et al., Proc. Natl. Acad. Sci. USA, 88, 2726-30 (1991)). These methods are useful not only for in vivo introduction of DNA into a subject, such as a mammal, but also for ex vivo modification of cells for reintroduction into a subject (which is discussed further elsewhere herein).

For standard gene gun administration, the vector or nucleic acid of interest is precipitated onto the surface of microscopic metal beads. The microprojectiles are accelerated with a shock wave or expanding helium gas, and penetrate tissues to a depth of several cell layers. For example, the Accel™ Gene Delivery Device manufactured by Agacetus, Inc. Middleton Wis. is suitable for use in this embodiment. The nucleic acid or vector can be delivered by such techniques, for example, intramuscularly, intradermally, subdermally, subcutaneously, and/or intraperitoneally. Additional devices and techniques related to biolistic delivery Int'l Patent Applications WO 99/2796, WO 99/08689, WO 99/04009, and WO 98/10750, and U.S. Pat. Nos. 5,525,510, 5,630,796, 5,865,796, and 6,010,478,

The nucleic acid can be delivered in association with a transfection-facilitating agent, examples of which were discussed above. The nucleic acid can be delivered topically and/or by liquid particle delivery (in contrast to solid particle biolistic delivery). Examples of such nucleic acid delivery techniques, compositions, and additional constructs that can be suitable as delivery vehicles for the nucleic acids of the invention are provided in, e.g., U.S. Pat. Nos. 5,591,601, 5,593,972, 5,679,647, 5,697,901, 5,698,436, 5,739,118, 5,770,580, 5,792,751, 5,804,566, 5,811,406, 5,817,637, 5,830,876, 5,830,877, 5,846,949, 5,849,719, 5,880,103, 5,922,687, 5,981,505, 6,087,341, 6,107,095, 6,110,898, and International Patent Applications WO 98/06863, WO 98/55495, and WO 99/57275, each of which is incorporated herein by reference in its entirety for all purposes.

The choice of delivery technique and form of the antigen can influence the type of immune response observed upon administration. For example, gene gun delivery of many antigens is associated with a Th2-biased response (indicated by higher IgG1 antibody titers and comparatively low IgG2a titers). Advantageously, at least some of the VLPs of the invention are expected to overcome this Th2-bias that can be observed with the administration of other dengue virus antigens. The bias of a particular immune response enables the physician or artisan to direct the immune response promoted by administration of the polypeptide and/or polynucleotide of the invention.

Alternatively, the nucleic acid can be delivered to the host by way of liposome-based gene delivery. Exemplary techniques and principles related to liposome-based gene delivery is provided in, e.g., Debs and Zhu (1993) WO 93/24640; Mannino and Gould-Fogerite (1988) BioTechniques 6(7):682-691; Rose U.S. Pat. No. 5,279,833; Brigham (1991) WO 91/06309; Brigham et al. (1989) Am J Med Sci 298:278-281; Nabel et al. (1990) Science 249:1285-1288; Hazinski et al. (1991) Am J Resp Cell Molec Biol 4:206-209; and Wang and Huang (1987) Proc Natl Acad Sci USA 84:7851-7855), and Felgner et al. (1987) Proc. Natl Acad. Sci. USA 84:7413-7414), each of which is incorporated herein by reference in its entirety for all purposes. Suitable liposome pharmaceutically acceptable compositions that can be used to deliver the nucleic acid are further described elsewhere herein.

Any immunogenic amount of nucleic acid can be used. Typically, where the nucleic acid is administered by injection, about 50 micrograms (ug) to 5 mg, usually about 100 ug to about 2.5 mg, typically about 500 μg to about 2 mg or about 800 μg to about 1.5 mg, and often about 2 mg or about 1 mg is administered.

The amount of DNA plasmid for use in these methods where administration is via a gene gun, e.g., typically is from about 100 to about 1000 times less than the amount used for direct injection. For example, for gene gun delivery, the amount of DNA plasmid corresponding to the first range above would be from about 50×10⁻⁸ g to 5×10⁻⁵ g (100 times less) or from about 50×10⁻⁹ to about 5×10⁻⁶ g. Despite such sensitivity, preferably at least about 1 μg of the nucleic acid is used in such biolistic delivery techniques.

The expression of the nucleic acid sequence encoding the recombinant dengue antigen can be operably linked to any suitable promoter and/or other expression controls sequences, examples of which were described above. For example, expression of the polynucleotide construct can be induced by using an inducible on- and off-gene expression system. Examples of such on- and off-gene expression systems include the Tet-On™ Gene Expression System and Tet-Off™ Gene Expression System (see, e.g., Clontech Catalog 2000, pg. 110-111 for a detailed description of each such system), respectively.

Delivery of a viral vector of the invention also can promote an immune response to at least one dengue virus of at least one serotype in a subject. Any suitable viral vector, in any suitable concentration, can be used to induce the immune response. For example, to the subject host can be administered a population of retroviral vectors (examples of which are described in, e.g., Buchscher et al. (1992) J. Virol. 66(5) 2731-2739, Johann et al. (1992) J. Virol. 66 (5): 1635-1640 (1992), Sommerfelt et al., (1990) Virol. 176:58-59, Wilson et al. (1989) J. Virol. 63:2374-2378, Miller et al., J. Virol. 65:2220-2224 (1991), Wong-Staal et al., PCT/US94/05700, Rosenburg and Fauci (1993) in FUNDAMENTAL IMMUNOLOGY, THIRD EDITION Paul (ed) Raven Press, Ltd., New York and the references therein), an AAV vector (as described in, e.g., West et al. (1987) Virology 160:38-47, Kotin (1994) Human Gene Therapy 5:793-801, Muzyczka (1994) J. Clin. Invst. 94:1351, Tratschin et al. (1985) Mol. Cell. Biol. 5(11):3251-3260, U.S. Pat. Nos. 4,797,368 and 5,173,414, and International Patent Application WO 93/24641), or an adenoviral vector (as described in, e.g., Berns et al. (1995) Ann. NY Acad. Sci. 772:95-104; Ali et al. (1994) Gene Ther. 1:367-384; and Haddada et al. (1995) Curr. Top. Microbiol. Immunol. 199 (Pt 3):297-306), such that immunogenic levels of expression of the nucleic acid included in the vector thereby occurs in vivo resulting in the desired immune response. Other suitable types of viral vectors are described elsewhere herein (including alternative examples of suitable retroviral, AAV, and adenoviral vectors).

Suitable infection conditions for these and other types of viral vector particles are described in, e.g., Bachrach et al., J. Virol., 74(18), 8480-6 (2000), Mackay et al., J. Virol., 19(2), 620-36 (1976), and FIELDS VIROLOGY, supra. Additional techniques useful in the production and application of viral vectors are provided in, e.g., “Practical Molecular Virology: Viral Vectors for Gene Expression” in METHODS IN MOLECULAR BIOLOGY, vol. 8, Collins, M. Ed., (Humana Press 1991), VIRAL VECTORS: BASIC SCIENCE AND GENE THERAPY, 1st Ed. (Cid-Arregui et al., Eds.) (Eaton Publishing 2000), “Viral Expression Vectors,” in CURRENT TOPICS IN MICROBIOLOGY AND IMMUNOLOGY, Oldstone et al., Eds. (Springer-Verlag, NY, 1992), and “Viral Vectors” in CURRENT COMMUNICATIONS IN BIOTECHNOLOGY, Gluzman and Hughes, Eds. (Cold Spring Harbor Laboratory Press, 1988).

The toxicity and therapeutic efficacy of the vectors that include recombinant molecules provided by the invention can be determined using standard pharmaceutical procedures in cell cultures or experimental animals. For example, the artisan can determine the LD₅₀ (the dose lethal to 50% of the population) and the ED₅₀ (the dose therapeutically effective in 50% of the population) using procedures presented herein and those otherwise known to those of skill in the art. Nucleic acids, polypeptides, proteins, fusion proteins, transduced cells and other formulations of the present invention can be administered at a rate determined, e.g., by the LD₅₀ of the formulation, and the side-effects thereof at various concentrations, as applied to the mass and overall health of the patient. Administration can be accomplished via single or divided doses.

The viral vector can be targeted to particular tissues, cells, and/or organs. Examples of such vectors are described above. E.g., the viral vector or nucleic acid vector can be used to selectively deliver the nucleic acid sequence of the invention to monocytes, dendritic cells, cells associated with dendritic cells (e.g., keratinocytes associated with Langerhans cells), T-cells, and/or B-cells. The viral vector can be a replication-deficient viral vector. The viral vector particle also can be modified to reduce host immune response to the viral vector, thereby achieving persistent gene expression. Such “stealth” vectors are described in, e.g., Martin, Exp. Mol. Pathol., 66(1):3-7 (1999), Croyle et al., J. Virol., 75(10): 4792-801 (2001), Rollins et al., Hum. Gene Ther., 7(5), 619-26 (1996), Ikeda et al., J. Virol., 74(10):4765-75 (2000), Halbert et al., J. Virol., 74(3), 1524-32 (2000), and Int'l Patent Appn WO 98/40509. Alternatively or additionally, the viral vector particles can be administered by a strategy selected to reduce host immune response to the vector particles. Strategies for reducing immune response to the viral vector particle upon administration to a host are provided in, e.g., Maione et al., Proc. Natl. Acad. Sci. USA, 98(11), 5986-91 (2001), Morral et al., Proc. Natl. Acad. Sci. USA, 96(22), 2816-21 (1999), Pastore et al., Hum. Gene Ther., 10(11), 1773-81 (1999), Morsy et al., Proc. Natl. Acad. Sci. USA, 95(14), 7866-71 (1998), Joos et al., Hum. Gene Ther., 7(13), 1555-66 (1996), Kass-Eisler et al., Gene Ther., 3(2), 154-62 (1996), U.S. Pat. Nos. 6,093,699, 6,211,160, 6,225,113, US Pat. Appn 2001-0066947A1.

Any suitable population and concentration (dosage) of viral vector particles can be used to induce the immune response in the subject host. In some aspects of the invention, at least about 1×10² particles are typically used (e.g., the method can comprises administering a composition comprising at least from about 1×10² particles/mL to about 1×10⁹ particles/mL of a suitable viral vector particle in about 1-2 mL injectable and pharmaceutically acceptable solution). When delivered to a host, the population of viral vector particles is such that the multiplicity of infection (MOI) desirably is at least from about 1 to about 100 and more preferably from at least about 5 to about 30. Considerations in viral vector particle dosing are described elsewhere herein.

The term “prime” generally refers to the administration or delivery of a polypeptide of the invention (e.g., recombinant dengue virus antigen) or a polynucleotide encoding such polypeptide to a cell culture or population of cells in vitro, or in vivo to a subject or ex vivo to tissue or cells of a subject. The first administration or delivery (primary contact) may not be sufficient to induce or promote a measurable response (e.g., antibody response), but may be sufficient to induce a memory response, or an enhanced secondary response. The term “challenge” generally refers to any procedure that induces, promotes, or modulates an immune response.

Preferably, the initial delivery or administration of a polypeptide or polynucleotide of the invention to cells or a cell culture in vitro, or in vivo or ex vivo to tissue or cells of a subject is followed by one or more secondary (usually repeat) administrations of the polynucleotide and/or polypeptide. For example, initial administration of a polypeptide composition can be followed, typically at least about 7 days after the initial polypeptide administration (more typically about 14-35 days or about 2, 4, 6, 12, or 24 months) after initial polypeptide administration), with a first repeat administration (“prime boost”) of a substantially similar (if not identical) dose of the polypeptide, typically in a similar amount as the first administration (e.g., about 5 μg to about 1 mg, or about 5 μg to 0.1 mg of polypeptide in a 1-2 mL injectable and pharmaceutically acceptable solution). Desirably, a second repeat administration (or “secondary boost”) is performed with a similar, if not identical, dose of the polypeptide composition at about 2-9, 3-6 months, 9-18 months, or about 12 or 24 months after the initial polypeptide administration.

Any technique comprising administering a polypeptide of the invention can also include the co-administration of one or more suitable adjuvants. Examples of suitable adjuvants include Freund's emulsified oil adjuvants (complete and incomplete), alum (aluminum hydroxide and/or aluminum phosphate), lipopolysaccharides (e.g., bacterial LPS), liposomes (including dried liposomes and cytokine-containing (e.g., IFN-γ-containing and/or GM-CSF-containing) liposomes), endotoxins, calcium phosphate and calcium compound microparticles (see, e.g., International Patent Application WO 00/46147), mycobacterial adjuvants, Arlacel A, mineral oil, emulsified peanut oil adjuvant (adjuvant 65), Bordetella pertussis products/toxins, Cholera toxins, non-ionic block polymer surfactants, Corynebacterium granulosum derived P40 component, fatty acids, aliphatic amines, paraffinic and vegetable oils, beryllium, and immunostimulating complexes (ISCOMs—reviewed in, e.g., Höglund et al. “ISCOMs and immunostimulation with viral antigens” in SUBCELLULAR BIOCHEMISTRY (Ed. Harris, J. R.) Plenum, N.Y., 1989, pp. 39-68), Morein et al., “The ISCOM—an approach to subunit vaccines” in RECOMBINANT DNA VACCINES: RATIONALE AND STRATEGY (Ed. Isaacson, R. E.) Marcel Dekker, N.Y., 1992, pp. 369-386, and Morein et al., Clin Immunotherapeutics 3:461-75 (1995)). Recently, monophosphoryl lipid A, ISCOMs with Quil-A, and Syntex adjuvant formulations (SAFs) containing the threonyl derivative or muramyl dipeptide also have been under consideration for use in human vaccines. Numerous types of adjuvants that can be suitable for co-administration or serial administration with one or more polypeptides of the invention are known in the art. Examples of such adjuvants are described in, e.g., Vogel et al., A COMPENDIUM OF VACCINE ADJUVANTS AND EXCIPIENTS (2d Ed) (“http” colon, slash, slash, “www” dot “niaid” dot “nih” dot “gov” slash “aidsvaccine” slash “pdf” slash “compendium” dot “pdf”—accessed Jan. 24, 2002), Bennet et al., J. Immun Meth 153:31-40 (1992), Bessler et al., Res Immunol, 143(5): 519-25 (1992), Woodard, Lab Animal Sci 39(3):222-5 (1989), Vogel, AIDS Res and Human Retroviruses 11(10):1277-1278 (1995), Leenaars et al., Vet Immunol Immunopath 40:225-241 (1995), Linblak et al., Scandinavian J Lab Animal Sci 14:1-13 (1987), Buiting et al. Res Immunol 143(5):541-548 (1992), Gupta and Siber, Vaccine (14):1263-1276 (1996), and U.S. Pat. Nos. 6,340,464, 6,328,965, 6,299,884, 6,083,505, 6,080,725, 6,060,068, 5,961,970, 5,814,321, 5,747,024, 5,690,942, 5,679,356, 5,650,155, 5,585,099, 4,395,394, and 4,370,265.

Administration of a nucleic acid of the invention also is typically and preferably followed by boosting (at least a prime, preferably at least a prime and secondary boost). A “prime” is typically the first immunization. An initial nucleic acid administration can be followed by a repeat administration of the nucleic acid at least about 7 days, more typically and preferably about 14-35 days, or about 2, 4, 6, 9, or 12 months, after the initial nucleic acid administration. The amount administered in the repeat administration is typically substantially similar (if not identical) to the dose of the nucleic acid initially administered, (e.g., about 50 μg to about 15 or 20 mg, or 1 mg to about 10 mg, or 2-5 mg in a 1-2 mL volume injectable and pharmaceutically acceptable solution).

Alternatively, the initial administration of the nucleic acid can be followed by a prime boost of an immunogenic amount of polypeptide at such a time. Preferably, in such aspects, a secondary boost also is preferably performed with nucleic acid and/or polypeptide, in an amount similar to that used in the primary boost and/or the initial nucleic acid administration, at about 2-9, 3-6 months, 9-18 months, or about 12 or 24 months after the initial polypeptide administration. Any number of boosting administrations of nucleic acid and/or polypeptide can be performed.

The polypeptide, nucleic acid, and/or vector of the invention can be used to promote any suitable immune response to at least one dengue virus of one or more serotypes in any suitable context. For example, at least one recombinant polypeptide, nucleic acid, and/or vector can be administered as a prophylactic in an immunogenic or antigenic amount to a mammal (preferably, a human) that has not been infected with a dengue virus of a particular serotype. Favorably, the administration of the at least one recombinant polypeptide, nucleic acid, and/or vector induces a protective immune response against challenge with at least one dengue virus of at least one serotype, and, as such, can be considered a “vaccine” against dengue virus infection by said at least one dengue virus of the at least one serotype. Preferably, the administration of the at least one recombinant polypeptide, nucleic acid, and/or vector induces a protective immune response against challenge with at least one dengue virus of each at least two serotypes, at least three serotypes, and preferably at least four serotypes and, as such, can be considered a “vaccine” against dengue virus infection by viruses of the at least two, at least three or all four dengue virus serotypes, respectively.

In an advantage aspect, at least one polypeptide, polynucleotide, and/or vector of the invention is administered to a mammal, preferably a human, that has been previously infected with at one dengue virus of at least one particular serotype, such that a protective immune response (such as, e.g., a neutralizing antibody immune response) to one or more dengue viruses of other serotypes is induced in the mammal (human), most preferably without the occurrence of ADE upon administration of the at least one polynucleotide, polypeptide, and/or vector, as well as upon challenge with one or more dengue viruses of serotypes other than the serotype of the dengue virus with the mammal was previously infected. The at least one polypeptide, polynucleotide, and/or vector also can be administered to a mammal (e.g., a human) actively infected with at least one dengue virus of one serotype, to aid in the production of an immune response against further dengue virus infections. Most preferably, the polypeptide, nucleic acid, and/or vector is administered in an amount sufficient to induce a protective immune response in a human at risk for dengue virus infection (or at specific risk for DF and/or DHF), or that is or has been previously infected with a dengue virus of at least one serotype, to avoid DHF upon secondary infection, preferably without the occurrence of ADE.

The polynucleotides and vectors of the invention can be delivered by ex vivo delivery of cells, tissues, or organs. As such, the invention provides a method of promoting an immune response to a dengue virus comprising inserting at least one nucleic acid of the invention and/or a vector of the invention into a population of cells and implanting the cells in a mammal. Ex vivo administration strategies are known in the art (see, e.g., U.S. Pat. No. 5,399,346 and Crystal et al., Cancer Chemother. Pharmacol., 43(Suppl.), S90-S99 (1999)). Cells or tissues can be injected by a needle or gene gun or implanted into a mammal ex vivo. Briefly, in ex vivo techniques, a culture of cell (e.g., organ cells, cells of the skin, muscle, etc.) or target tissue is provided, or preferably removed from the host, contacted with the vector or polynucleotide composition, and then reimplanted into the host (e.g., using techniques described in or similar to those provided in). Ex vivo administration of the nucleic acid can be used to avoid undesired integration of the nucleic acid and to provide targeted delivery of the nucleic acid or vector. Such techniques can be performed with cultured tissues or synthetically generated tissue. Alternatively, cells can be provided or removed from the host, contacted (e.g., incubated with) an immunogenic amount of a polypeptide of the invention that is effective in prophylactically inducing an immune response to a dengue virus (preferably a protective immune response, such as a protective neutralizing antibody response) when the cells are implanted or reimplanted to the host. The contacted cells are then delivered or returned to the subject to the site from which they were obtained or to another site (e.g., including those defined above) of interest in the subject to be treated. If desired, the contacted cells may be grafted onto a tissue, organ, or system site (including all described above) of interest in the subject using standard and well-known grafting techniques or, e.g., delivered to the blood or lymph system using standard delivery or transfusion techniques. Such techniques can be performed with any suitable type of cells. For example, in one aspect, activated T cells can be provided by obtaining T cells from a subject (e.g., mammal, such as a human) and administering to the T cells a sufficient amount of one or more polypeptides of the invention to activate effectively the T cells (or administering a sufficient amount of one or more nucleic acids of the invention with a promoter such that uptake of the nucleic acid into one or more such T cells occurs and sufficient expression of the nucleic acid results to produce an amount of a polypeptide effective to activate said T cells). The activated T cells are then returned to the subject. T cells can be obtained or isolated from the subject by a variety of methods known in the art, including, e.g., by deriving T cells from peripheral blood of the subject or obtaining T cells directly from a tumor of the subject. Other preferred cells for ex vivo methods include explanted lymphocytes, particularly B cells, antigen presenting cells (APCs), such as dendritic cells, and more particularly Langerhans cells, monocytes, macrophages, bone marrow aspirates, or universal donor stem cells. A preferred aspect of ex vivo administration of a polynucleotide or polynucleotide vector can be the assurance that the polynucleotide has not integrated into the genome of the cells before delivery or readministration of the cells to a host. If desired, cells can be selected for those where uptake of the polynucleotide or vector, without integration, has occurred, using standard techniques known in the art.

The invention includes a method of inducing an immune response in a subject to at least one dengue virus of at least one serotype comprising: (a) providing a population of B cells, dendritic cells, or both; (b) transforming the cells with at least one nucleic acid of the invention such that the nucleic acid does not integrate into a genome of any of the cells, and (c) delivering an effective amount of the cells to the subject, wherein the cells express the at least one nucleic acid after delivery and induce an immune response to the at least one dengue virus in the subject. In some such methods, prior to transforming the cells with the nucleic acid, the cells are obtained from a subject, and after transformation with the at least one nucleic acid, the cells are delivered to the same subject. Some such methods further comprise delivering at least one of the following to a subject: 1) polypeptide comprising GM-CSF or an interferon (IFN), 2) a nucleic acid encoding GM-CSF or an interferon, and 3) a nucleic acid encoding GM-CSF and an interferon.

In another aspect, the invention provides a method of inducing an immune response by administering a population of recombinant VLPs or attenuated viruses of the invention, formed by populations of polypeptides comprising, e.g., recombinant C15/full prM/full E polypeptides or full prM/full E polypeptides of the invention. The administration of VLPs or attenuated viruses is carried out using techniques similar to those used for the administration of polypeptides and viral vectors, as described above (e.g., VLPs are preferably administered in a pharmaceutically acceptable injectable solution into or through the skin, intramuscularly, or intraperitoneally). The skin and muscle are generally preferred targets for administration of the polypeptides, vectors, and polynucleotides of the invention, by any suitable technique. Thus, the delivery of the polypeptide, polynucleotide, or vector of the invention into or through the skin of a subject, e.g., mammal (preferably a human), is a feature of the invention. Such administration can be accomplished by transdermal devices, or, more typically, biolistic delivery of the polypeptide, polynucleotide, and/or vector to, into, or through the skin of the mammal, or into exposed muscle of the subject. Transdermal devices provided by the invention, described elsewhere herein, for example, can be applied to the skin of a host for a suitable period such that sufficient transfer of a polynucleotide and/or vector to the mammal occurs, thereby promoting an immune response to at least one dengue virus. Muscular administration is more typically facilitated by injection of a liquid solution comprising a polypeptide, polynucleotide, or vector of the invention. Particular cells that can be advantageously targeted include dendritic cells, other APCs, B cells, monocytes, T cells (including T helper cells), and cells associated with such immune system cells (e.g., keratinocytes or other skin cells associated with Langerhans cells). Targeting of vectors and nucleic acids of the invention is described elsewhere herein. Such targeted administration can be performed with nucleic acids or vectors comprising nucleic acids operably linked to cell and/or tissue-specific promoters, examples of which are known in the art.

The polynucleotide of the invention can be administered by any suitable delivery system, such that expression of a recombinant of the polypeptide occurs in the host resulting in an immune response to a dengue virus. For example, an effective amount of a population of bacterial cells comprising a nucleic acid of the invention can be administered to a subject, resulting in expression of a recombinant polypeptide of the invention, and induction of an immune response to dengue viruses in the subject, e.g., mammal. Bacterial cells developed for mammalian gene delivery are known in the art. In another aspect, administration of a polynucleotide or vector (preferably a polynucleotide vector) of the invention is facilitated by application of electroporation to an effective number of cells or an effective tissue target, such that the nucleic acid and/or vector is taken up by the cells, and expressed therein, resulting in production of a recombinant polypeptide of the invention therein and subsequent induction of an immune response to dengue viruses in the mammal.

In some aspects, the nucleic acid, polypeptide, and/or vector of the invention is desirably co-administered with an additional nucleic acid or additional nucleic vector comprising an additional nucleic acid that increases the immune response to a dengue virus upon administration of the nucleic acid, polypeptide, and/or vector of the invention. Preferably, such a second nucleic acid comprises a sequence encoding a GM-CSF, an interferon (e.g., IFN-gamma), or both, examples of which are discussed elsewhere herein. Alternatively, the second nucleic acid can comprise immunostimulatory (CpG) sequences, as described elsewhere herein. GM-CSF, IFN-gamma, or other polypeptide adjuvants also can be co-administered with the polypeptide, polynucleotide, and/or vector. Co-administration in this respect encompasses administration before, simultaneously with, or after, the administration of the polynucleotide, polypeptide, and/or vector of the invention, at any suitable time resulting in an enhancement of an immune response to a dengue virus.

Dengue viruses are transmitted through Aedes mosquito bites, posing a significant threat to people living in or visiting tropical areas. Dengue virus infections are clinically manifested for most of the cases by dengue fever (DF), which is a self-limited fibril illness. The severe to fatal dengue hemorrhagic fever/dengue shock syndrome (DHF/DSS), which is associated with a mortality rate between 1-5%, is often linked to secondary infections.

Multiple dengue serotypes can be prevalent in one local area, and it is therefore important to test a patient's serum samples simultaneously against such multiple serotypes, preferably those prevalent in the area in which the patient lives. In areas where all four dengue serotypes may be prevalent, it is important to test a patient's serum samples simultaneously against all 4 serotypes. For treatment, it is not always necessary to determine the specific serotype that has infected a patient; however, it is important to distinguish the specific infecting virus(es) from other viruses causing hemorrhagic fever. Cultivating or growing DEN viruses in cultures is very tedious and the quality of the virus samples obtained often varies extensively, due to the growth abilities and stability of the different viruses. This makes it difficult to produce ELISA plates with consistent antigen quality for multiple (e.g., 2, 3, especially 4) serotypes in each well. Additionally, due to the difficulty of obtaining inactivated viruses, such assay plates are very expensive and for large scale clinical testing in poor countries not affordable.

The invention provides new diagnostic assays using at least one recombinant chimeric dengue virus antigen polypeptide that displays one or more conformational epitopes of one or more of the four dengue virus serotypes and/or recognizes one or more antibodies against at least one dengue virus of each of at least one, two, three or four serotypes. Such recombinant polypeptides are recognized by type-specific antisera. Such recombinant dengue virus antigens of the invention are useful as diagnostic tools to capture antibodies against one, two, three, and preferably all 4 dengue serotypes. In a particular aspect, the invention provides diagnostic assays using at least one recombinant or synthetic polypeptide of the invention that displays one or more conformational epitopes of each of two, three, or four dengue virus serotypes (DEN-1, DEN-2, DEN-3, and/or DEN-4) and/or recognizes one or more antibodies against at least one dengue virus of each of at least, two, three or four serotypes (e.g., multivalent antigens).

In a preferred aspect, the invention provides diagnostic assays using at least one recombinant or synthetic polypeptide of the invention that displays one or more conformational epitopes of each of the four 4 DEN virus serotypes and/or recognizes antibodies against at least one dengue virus of each of the four serotypes. As shown below in detail in the Examples below, such tetravalent antigenic polypeptides induced an antibody response in vivo in subjects and are useful as vaccine candidates.

For example, four recombinant PRM15/tE polypeptides (2/7E, 5/21, 2G11, and 6E12) were selected to test as diagnostic antigens (see Example 19 below for details). Alternatively, “full-length” clones (2/7E-D1, 5/217E-D1, 2G11E-D4, and 6E12-D4) can be used. It was previously shown for TBE, another flavivirus, that expression of the viral prM and 100% of the E gene can lead to viral-like particle (VLP) formation, physically and antigenically resembling the virus particles. It is believed recombinant C15/full prM/full E (e.g., 2/7E-D1, 5/21-D1, 2G11-D4, and 6E12-D4) or full prM/full E clones of the invention form VLPs. Expressed in human 293 cells, VLP-like antigens are secreted into the medium, which allows for easy antigen isolation.

The Western blot of FIG. 13A illustrates recognition of the four tetravalent clones (2/7E, 5/21, 2G11, and 6E12) by type-specific antisera and the expression and secretion of the antigens from human 293 cells. For diagnostics, the antigens can be used for a dot blot assay, ELISA, or dipstick EIA. A dot blot assay is an advantageous assay in terms of manufacturing, storage and handling. The use of the recombinant multivalent antigens as a diagnostic tool was demonstrated by a dot blot assay as shown in Example 19. Each of these recombinant tetravalent polypeptides was well-secreted and recognized by all 4 type-specific anti-DEN antisera and thus can be used to detect serum antibodies against any of the 4 DEN serotypes in a biological human sample obtained from an animal, including a human. Such diagnostic assays advantageously allow for the testing of a subject's serum sample simultaneously for antibodies against all four serotypes.

The invention further provides methods of diagnosing or screening a composition, preferably a biological sample obtained from a subject (e.g., vertebrate, such as a mammal), such as blood or serum, for the presence or absence of one or more anti-flavivirus antibodies of one or more virus serotypes, including one or more antibodies against one or more flaviviruses or variants thereof that are closely related to one or more dengue viruses, and especially anti-dengue virus antibodies (including, e.g., antibodies against one or more dengue viruses or variants thereof). In one such aspect, the invention provides a method of diagnosing or screening a sample for the presence of one or more types of antibodies (or detecting in the composition the presence of one or more antibodies) that bind to at least one dengue virus of at least one serotype. The method comprises contacting a sample with a polypeptide of the invention under conditions such that if the sample comprises antibodies that bind to at least one dengue virus of at least one serotype at least one anti-dengue virus antibody binds to the polypeptide to form a mixed composition, contacting the mixed composition with at least one affinity-molecule that binds to an anti-dengue virus antibody, removing unbound affinity-molecule from the mixed composition, and detecting the presence or absence of affinity molecules in the composition, wherein the presence of an affinity molecule is indicative of the presence of antibodies in the sample that bind to the at least one dengue virus of the at least one serotype.

In another aspect, the invention provides a method of diagnosing, detecting in, identifying in, selecting from, or screening a sample for the presence of antibodies that bind or specifically bind to (or react with) at least one dengue virus of at least one serotype. In one aspect, such method comprises contacting a sample with a polypeptide of the invention under conditions such that if the sample comprises antibodies that bind to dengue virus at least one anti-dengue virus antibody binds to the polypeptide to form an antibody-polypeptide complex and detecting the presence or absence of an antibody-polypeptide complex, wherein the presence of an antibody-polypeptide complex is indicative of the presence of antibodies that bind to a dengue virus. In some such methods, the method comprises screening, detecting in, selecting from, or diagnosing a sample for the presence of antibodies that specifically bind to or specifically associate with a dengue virus of one or more serotypes. Preferably, the sample is a biological sample, preferably obtained from a mammal, which typically is suspected of and/or at risk for infection with one or more dengue viruses. Any suitable biological sample (i.e., that includes a sufficient quantity of antibodies for analysis, if present) can be used. Typically and preferably, serum from a mammal, typically a human, is obtained and used for such analysis. Alternatively, tissues where antibody concentrations are expected to be high, such as lymphoid tissues, can be analyzed.

The invention also includes an immunoassay for at least one dengue virus antibody which comprises the use of a polypeptide of the invention as a test sample. The above-described methods can further be modified to form any suitable type of immunoassay, examples of which are described above. Preferred immunoassays in this respect include dot blot assays, ELISA assays (e.g., competitive ELISA assays), and dipstick EIAs. In preparation of such assays, the polypeptide is bound (or associated with) a solid or semisolid matrix, to promote antigen-antibody complex formation. The detection of such antibody-antigen complexes is typically facilitated with a reagent suitable for visualization, such as dyes used in ELISA and FACS assays described elsewhere herein. Compositions comprising such elements are provided by the invention. For example, the invention provides a composition comprising at least one polypeptide of the invention bound to a solid matrix, and optionally including a reagent for visualizing an antibody bound to the polypeptide.

The invention also includes a kit for performing such an immunoassay comprising a composition of a polypeptide of the invention, bound to a solid matrix, in combination with a reagent suitable for visualization of antigen-antibody complexes after incubation of the matrix with a biological sample suspected of comprising anti-dengue virus antibodies.

A suitable substrate for performing an immunoassay to detect one or more anti-dengue virus antibodies in a sample composition is advantageously provided by obtaining cell free medium, aspirated from a culture of cells transformed with a polynucleotide of the invention (including a nucleic acid vector), or infected with a viral vector of the invention, which cells at least partially secrete a polypeptide of the invention into the cell medium such that the aspirated medium (supernatant) comprises a sufficient amount of polypeptide for use in the immunoassay. Remarkably, as little as about 10 μl of such a cell supernatant can be used as a substrate for a sensitive immunoassay, which is able to detect the presence of antibodies to dengue viruses of multiple serotypes, and, most preferably, to all four virus serotypes, preferably in a sample of serum obtained from a mammalian host (e.g., a human). The inventors contemplate the use of larger amounts of such supernatant (e.g., about 20 μl, about 50 μl, about 100 μl, or more), as well as the use of cell lysates of cells transfected with nucleic acids (or nucleic acid vectors) of the invention, as well as of cells infected with viral vectors of the invention. The supernatant can be associated with a matrix for performing EIAs (e.g., with an ELISA plate for ELISA assay or with a suitable membrane for dot blot assay) or can be directly used in an immunoprecipitation or other direct detection immunohistochemical technique. Similar techniques that can be modified with reference to the polypeptides of the invention are described in, e.g., U.S. Pat. No. 5,939,254 and other references cited herein.

The invention also includes a method of identifying the presence of antibodies to a flavivirus in a biological sample obtained from a subject, such as a mammal, comprising contacting at least one polypeptide of the invention (or composition comprising at least one such polypeptide and a carrier or a solid matrix) with a biological sample obtained from the subject under conditions such that an antibody capable of binding to a flavivirus in the biological sample binds to the polypeptide and forms an antibody-polypeptide complex; and detecting the presence of the antibody-polypeptide complex in the biological sample, thereby indicating the presence of antibodies in the biological sample (e.g., blood or serum).

Pools or libraries of two or more polypeptides of the invention also can be used in diagnosis techniques. Alternatively, a polypeptide of the invention can be added to a pool of other molecules (e.g., a pool of polypeptides, such as a collection of viral antigens). Thus, a library comprising two or more polypeptides of the invention is a feature of the invention. Another feature of the invention is a library of polypeptides of the invention (e.g., a collection of fragments of polypeptides of the invention or a collection of substantially identical polypeptides of the invention). The polypeptide(s) of the invention can be used in such libraries for diagnostic techniques (e.g., multiple diagnostic techniques for viral infection and/or other disease diagnosis). For example, a library of pathogenic antigens from pathogens associated with fever (or other disease states), comprising at least one polypeptide of the invention, can be used to diagnose infection of a mammal, preferably a human, by reaction of a biological sample obtained from the mammal with such a library in a manner that a detectable biological reaction between the sample and at least one component of the library will occur, thereby indicating what type of infection the mammal suffers from. The incorporation of one or more polypeptides of the invention in diagnostic chips (“protein chips”) for such diagnostic techniques is a feature of the invention.

In another respect, the invention provides a polypeptide obtained by recursive sequence recombination (e.g., DNA shuffling and appropriate screening/selection methods) performed with a nucleic acid sequence of the invention (typically with multiple nucleic acid sequences of the invention and/or multiple wild-type flaviviral, preferably dengue virus, antigen-encoding sequences). For example, the invention provides a polypeptide obtained by a method of recursive sequence recombination that comprises recombining at least a first nucleic acid comprising a sequence selected from the group of SEQ ID NOS:169-174 and a second nucleic acid selected from the group of SEQ ID NOS:169-174 and 215-218, to produce a library of recombinant or synthetic nucleic acids, and screening the resulting library of recombinant or synthetic nucleic acids to identify at least one optimized nucleic acid that encodes a recombinant polypeptide that induces an immune response to at least a portion of dengue viruses of at least one virus serotype in a subject about equal to or greater than the immune response induced by a polypeptide encoded by the first nucleic acid, a polypeptide encoded by the second nucleic acid, or both. Typically, multiple nucleic acids selected from the first group and from the second group are used to generate or produce the library of nucleic acids.

In one embodiment, the invention provides a recombinant or synthetic polypeptide obtained by a method comprising: (a) recombining at least a first nucleic acid comprising a polynucleotide sequence selected from the group of SEQ ID NOS:211-214 and at least a second nucleic acid, wherein the at least first and second nucleic acids differ from each other in two or more nucleotides, to produce a library of recombinant or synthetic nucleic acids; (b) selecting from or screening the library of recombinant or synthetic nucleic acids to identify at least one recombinant or synthetic nucleic acid that encodes at least one recombinant or synthetic polypeptide that induces an immune response in a subject to at least one dengue virus of at least one serotype that is about equal to or greater than the immune response induced in the subject against said at least one dengue virus of said at least one serotype by the polypeptide encoded by the at least first nucleic acid or the at least second nucleic acid or both; and (c) expressing the at least one recombinant or synthetic nucleic acid to obtain the recombinant or synthetic polypeptide.

In another embodiment, the invention provides a recombinant or synthetic polypeptide obtained by a method comprising: (a) recombining at least a first nucleic acid comprising a polynucleotide sequence selected from the group of SEQ ID NOS:211-214 and at least a second nucleic acid, wherein the at least first and second nucleic acids differ from each other in two or more nucleotides, to produce a library of recombinant or synthetic nucleic acids; (b) selecting or screening the library of recombinant or synthetic nucleic acids to identify at least one recombinant or synthetic nucleic acid that encodes a recombinant or synthetic polypeptide that induces an immune response in a subject to at least one dengue virus of at least one serotype that is about equal to or greater than the immune response induced in the subject against said at least one dengue virus of said at least one serotype by a polypeptide encoded by the at least first nucleic acid and the at least second nucleic acid; (c) recombining the at least one recombinant or synthetic nucleic acid with at least a third nucleic acid comprising a sequence selected from the group of SEQ ID NOS:211-214, to produce a second library of recombinant or synthetic nucleic acid; (d) selecting from or screening the second library of recombinant or synthetic nucleic acids to identify at least one further recombinant or synthetic nucleic acid that encodes at least one further recombinant or synthetic polypeptide that induces an immune response to at least one dengue of at least one serotype in a subject that is about equal to or greater than the immune response induced in the subject against said at least one dengue virus of said at least one serotype by the polypeptide encoded by the at least first nucleic acid, the at least second nucleic acid, or the at least third nucleic acid, or any combination thereof; and (e) expressing the at least one further recombinant or synthetic to obtain the at least one further recombinant or synthetic polypeptide. Some such recombinant or synthetic polypeptides of the two embodiments above induce an immune response against the polypeptide encoded by the at least first nucleic acid and against the polypeptide encoded by the at least second nucleic acid in a subject. Further, some such recombinant or synthetic polypeptides of the two embodiments above induce an immune response against at least one dengue virus of each of dengue-1, dengue-2, dengue-3, and dengue-4 virus serotypes in a subject that is about equal to or greater than the immune response induced by the polypeptide encoded by the at least first nucleic acid, by the polypeptide encoded by the at least second nucleic acid, by the polypeptide encoded by the at least third nucleic acid, or by any combination thereof.

In another embodiment, the invention provides a recombinant or synthetic polypeptide obtained by a method comprising: (a) recombining at least a first nucleic acid comprising a polynucleotide sequence selected from the group of SEQ ID NOS:215-218, and at least a second nucleic acid, wherein the at least first and second nucleic acids differ from each other in two or more nucleotides, to produce a library of recombinant or synthetic nucleic acids; (b) selecting from or screening the library of recombinant or synthetic nucleic acids to identify at least one recombinant or synthesized nucleic acid that encodes at least one recombinant or synthetic polypeptide that induces an immune response to at least one dengue virus of at least one virus serotype in a subject that is about equal to or greater than the immune response induced against said at least one dengue virus of said at least one serotype by the polypeptide encoded by the at least first nucleic acid or the at least the second nucleic acid or both; and (c) expressing the at least one recombinant or synthetic nucleic acid to obtain the at least recombinant or synthetic polypeptide.

More generally, the polynucleotides of the invention and fragments thereof can be used as substrates for any of a variety of recombination and recursive sequence recombination reactions, in addition to their use in standard cloning methods as set forth in, e.g., Ausubel, Berger, and Sambrook, e.g., to produce additional polynucleotides or fragments thereof that encode recombinant dengue virus antigens having desired properties. A variety of such reactions are known, including those developed by the inventors and their co-workers.

A variety of diversity generating protocols for generating and identifying molecules of the invention having one of more of the properties described herein are available and described in the art. These procedures can be used separately, and/or in combination to produce one or more variants of a nucleic acid or set of nucleic acids, as well variants of encoded proteins. Individually and collectively, these procedures provide robust, widely applicable ways of generating diversified nucleic acids and sets of nucleic acids (including, e.g., nucleic acid libraries) useful, e.g., for the engineering or rapid evolution of nucleic acids, proteins, pathways, cells and/or organisms with new and/or improved characteristics. While distinctions and classifications are made in the course of the ensuing discussion for clarity, it will be appreciated that the techniques are often not mutually exclusive. Indeed, the various methods can be used singly or in combination, in parallel or in series, to access diverse sequence variants.

The result of any of the diversity-generating procedures described herein can be the generation of one or more nucleic acids, which can be selected or screened for nucleic acids with or which confer desirable properties, or that encode proteins with or which confer desirable properties. Following diversification by one or more of the methods herein, or otherwise available to one of skill, any nucleic acids that are produced can be selected for a desired activity or property described herein, including, e.g., an ability to induce, promote, enhance, or modulate an immune response against at least one dengue virus of at least one serotype, T cell proliferation and/or activation, cytokine production (e.g., (e.g., IL-3 production and/or IFN-γ production), the production of antibodies that bind (react) with at least one flavivirus (e.g., dengue virus) of at least one serotype, preferably two, three or four serotypes in a subject, and/or the production of neutralizing antibodies against at least one flavivirus (such as a dengue virus) of at least one, and against at least one flavivirus of each of at least one, two, three or four serotypes in a subject. For example, the desired property may be the ability to induce, promote, modulate or enhance the production of neutralizing antibodies against at least one dengue virus of each of the dengue-1, dengue-2, dengue-3, and dengue-4 serotypes in a subject. This can include identifying any activity that can be detected, for example, in an automated or automatable format, by any of the assays in the art, such as the assays discussed herein and exemplified in the Examples in the Example section below. A variety of related (or even unrelated) properties can be evaluated, in serial or in parallel, at the discretion of the practitioner.

Descriptions of a variety of diversity generating procedures for generating modified nucleic acid sequences that encode polypeptides of the invention as described herein are found in the following publications and the references cited therein: Soong, N. et al. (2000) “Molecular breeding of viruses” Nat Genet 25(4):436-439; Stemmer, et al. (1999) “Molecular breeding of viruses for targeting and other clinical properties” Tumor Targeting 4:1-4; Ness et al. (1999) “DNA Shuffling of subgenomic sequences of subtilisin” Nature Biotechnology 17:893-896; Chang et al. (1999) “Evolution of a cytokine using DNA family shuffling” Nature Biotechnology 17:793-797; Minshull and Stemmer (1999) “Protein evolution by molecular breeding” Current Opinion in Chemical Biology 3:284-290; Christians et al. (1999) “Directed evolution of thymidine kinase for AZT phosphorylation using DNA family shuffling” Nature Biotechnology 17:259-264; Crameri et al. (1998) “DNA shuffling of a family of genes from diverse species accelerates directed evolution” Nature 391:288-291; Crameri et al. (1997) “Molecular evolution of an arsenate detoxification pathway by DNA shuffling,” Nature Biotechnology 15:436-438; Zhang et al. (1997) “Directed evolution of an effective fucosidase from a galactosidase by DNA shuffling and screening” Proc. Natl. Acad. Sci. USA 94:4504-4509; Patten et al. (1997) “Applications of DNA Shuffling to Pharmaceuticals and Vaccines” Current Opinion in Biotechnology 8:724-733; Crameri et al. (1996) “Construction and evolution of antibody-phage libraries by DNA shuffling” Nature Medicine 2:100-103; Crameri et al. (1996) “Improved green fluorescent protein by molecular evolution using DNA shuffling” Nature Biotechnology 14:315-319; Gates et al. (1996) “Affinity selective isolation of ligands from peptide libraries through display on a lac repressor ‘headpiece dimer’” Journal of Molecular Biology 255:373-386; Stemmer (1996) “Sexual PCR and Assembly PCR” In: The Encyclopedia of Molecular Biology. VCH Publishers, New York. pp. 447-457; Crameri and Stemmer (1995) “Combinatorial multiple cassette mutagenesis creates all the permutations of mutant and wildtype cassettes” BioTechniques 18:194-195; Stemmer et al., (1995) “Single-step assembly of a gene and entire plasmid form large numbers of oligodeoxy-ribonucleotides” Gene, 164:49-53; Stemmer (1995) “The Evolution of Molecular Computation” Science 270: 1510; Stemmer (1995) “Searching Sequence Space” Bio/Technology 13:549-553; Stemmer (1994) “Rapid evolution of a protein in vitro by DNA shuffling” Nature 370:389-391; and Stemmer (1994) “DNA shuffling by random fragmentation and reassembly: In vitro recombination for molecular evolution.” Proc. Natl. Acad. Sci. USA 91:10747-10751.

The term “shuffling” is used herein to indicate recombination between non-identical sequences, in some embodiments shuffling may include crossover via homologous recombination or via non-homologous recombination, such as via cre/lox and/or flp/frt systems. Shuffling can be carried out by employing a variety of different formats, including for example, in vitro and in vivo shuffling formats, in silico shuffling formats, shuffling formats that utilize either double-stranded or single-stranded templates, primer based shuffling formats, nucleic acid fragmentation-based shuffling formats, and oligonucleotide-mediated shuffling formats, all of which are based on recombination events between non-identical sequences and are described in more detail or referenced herein below, as well as other similar recombination-based formats.

Mutational methods of generating diversity include, for example, site-directed mutagenesis (Ling et al. (1997) “Approaches to DNA mutagenesis: an overview” Anal Biochem. 254(2): 157-178; Dale et al. (1996) “Oligonucleotide-directed random mutagenesis using the phosphorothioate method” Methods Mol. Biol. 57:369-374; Smith (1985) “In vitro mutagenesis” Ann. Rev. Genet. 19:423-462; Botstein & Shortle (1985) “Strategies and applications of in vitro mutagenesis” Science 229:1193-1201; Carter (1986) “Site-directed mutagenesis” Biochem. J. 237:1-7; and Kunkel (1987) “The efficiency of oligonucleotide directed mutagenesis” in Nucleic Acids & Molecular Biology (Eckstein, F. and Lilley, D. M. J. eds., Springer Verlag, Berlin)); mutagenesis using uracil containing templates (Kunkel (1985) “Rapid and efficient site-specific mutagenesis without phenotypic selection” Proc. Natl. Acad. Sci. USA 82:488-492; Kunkel et al. (1987) “Rapid and efficient site-specific mutagenesis without phenotypic selection” Methods in Enzymol. 154, 367-382; and Bass et al. (1988) “Mutant Trp repressors with new DNA-binding specificities” Science 242:240-245); oligonucleotide-directed mutagenesis (Methods in Enzymol. 100: 468-500 (1983); Methods in Enzymol. 154: 329-350 (1987); Zoller & Smith (1982) “Oligonucleotide-directed mutagenesis using M13-derived vectors: an efficient and general procedure for the production of point mutations in any DNA fragment” Nucleic Acids Res. 10:6487-6500; Zoller & Smith (1983) “Oligonucleotide-directed mutagenesis of DNA fragments cloned into M13 vectors” Methods in Enzymol. 100:468-500; and Zoller & Smith (1987) “Oligonucleotide-directed mutagenesis: a simple method using two oligonucleotide primers and a single-stranded DNA template” Methods in Enzymol. 154:329-350); phosphorothioate-modified DNA mutagenesis (Taylor et al. (1985) “The use of phosphorothioate-modified DNA in restriction enzyme reactions to prepare nicked DNA” Nucl. Acids Res. 13: 8749-8764; Taylor et al. (1985) “The rapid generation of oligonucleotide-directed mutations at high frequency using phosphorothioate-modified DNA” Nucl. Acids Res. 13: 8765-8787 (1985); Nakamaye & Eckstein (1986) “Inhibition of restriction endonuclease Nci I cleavage by phosphorothioate groups and its application to oligonucleotide-directed mutagenesis” Nucl. Acids Res. 14: 9679-9698; Sayers et al. (1988) “Y-T Exonucleases in phosphorothioate-based oligonucleotide-directed mutagenesis” Nucl. Acids Res. 16:791-802; and Sayers et al. (1988) “Strand specific cleavage of phosphorothioate-containing DNA by reaction with restriction endonucleases in the presence of ethidium bromide” Nucl. Acids Res. 16: 803-814); mutagenesis using gapped duplex DNA (Kramer et al. (1984) “The gapped duplex DNA approach to oligonucleotide-directed mutation construction” Nucl. Acids Res. 12: 9441-9456; Kramer & Fritz (1987) Methods in Enzymol. “Oligonucleotide-directed construction of mutations via gapped duplex DNA” 154:350-367; Kramer et al. (1988) “Improved enzymatic in vitro reactions in the gapped duplex DNA approach to oligonucleotide-directed construction of mutations” Nucl. Acids Res. 16: 7207; and Fritz et al. (1988) “Oligonucleotide-directed construction of mutations: a gapped duplex DNA procedure without enzymatic reactions in vitro” Nucl. Acids Res. 16: 6987-6999).

Additional suitable diversity-generating methods include point mismatch repair (Kramer et al. (1984) “Point Mismatch Repair” Cell 38:879-887), mutagenesis using repair-deficient host strains (Carter et al. (1985) “Improved oligonucleotide site-directed mutagenesis using M13 vectors” Nucl. Acids Res. 13: 4431-4443; and Carter (1987) “Improved oligonucleotide-directed mutagenesis using M13 vectors” Methods in Enzymol. 154: 382-403), deletion mutagenesis (Eghtedarzadeh & Henikoff (1986) “Use of oligonucleotides to generate large deletions” Nucl. Acids Res. 14: 5115), restriction-selection and restriction-purification (Wells et al. (1986) “Importance of hydrogen-bond formation in stabilizing the transition state of subtilisin” Phil. Trans. R. Soc. Lond. A 317: 415-423), mutagenesis by total gene synthesis (Nambiar et al. (1984) “Total synthesis and cloning of a gene coding for the ribonuclease S protein” Science 223: 1299-1301; Sakamar and Khorana (1988) “Total synthesis and expression of a gene for the a-subunit of bovine rod outer segment guanine nucleotide-binding protein (transducin)” Nucl. Acids Res. 14: 6361-6372; Wells et al. (1985) “Cassette mutagenesis: an efficient method for generation of multiple mutations at defined sites” Gene 34:315-323; and Grundstrom et al. (1985) “Oligonucleotide-directed mutagenesis by microscale ‘shot-gun’ gene synthesis” Nucl. Acids Res. 13: 3305-3316), double-strand break repair (Mandecki (1986) “Oligonucleotide-directed double-strand break repair in plasmids of Escherichia coli: a method for site-specific mutagenesis” Proc. Natl. Acad. Sci. USA, 83:7177-7181; and Arnold (1993) “Protein engineering for unusual environments” Current Opinion in Biotechnology 4:450-455). Additional details on many of the above methods can be found in Methods in Enzymology Volume 154, which also describes useful controls for trouble-shooting problems with various mutagenesis methods.

Additional site-mutagenesis techniques are described in, e.g., Edelman et al., DNA, 2, 183 (1983), Zoller et al., Nucl. Acids Res., 10, 6487-5400 (1982), and Veira et al., Meth. Enzymol., 153, 3 (1987)). Other useful mutagenesis techniques include alanine scanning, or random mutagenesis, such as iterated random point mutagenesis induced by error-prone PCR, chemical mutagen exposure, or polynucleotide expression in mutator cells (see, e.g., Bornscheueret et al., Biotechnol. Bioeng., 58, 554-59 (1998), Cadwell and Joyce, PCR Methods Appl., 3(6), S136-40 (1994), Kunkel et al., Methods Enzymol., 204, 125-39 (1991), Low et al., J. Mol. Biol., 260, 359-68 (1996), Taguchi et al., Appl. Environ. Microbiol., 64(2), 492-95 (1998), and Zhao et al., Nat. Biotech., 16, 258-61 (1998) for discussion of such techniques). Suitable primers for PCR-based site-directed mutagenesis or related techniques can be prepared by the methods described in, e.g., Crea et al., Proc. Natl. Acad. Sci. USA, 75, 5765 (1978).

Other useful techniques for promoting sequence diversity include PCR mutagenesis techniques (as described in, e.g., Kirsch et al., Nucl. Acids Res., 26(7), 1848-50 (1998), Seraphin et al., Nucl. Acids Res., 24(16), 3276-7 (1996), Caldwell et al., PCR Methods Appl., 2(1), 28-33 (1992), Rice et al., Proc. Natl. Acad. Sci. USA. 89(12), 5467-71 (1992) and U.S. Pat. No. 5,512,463), cassette mutagenesis techniques based on the methods described in Wells et al., Gene, 34, 315 (1985), phagemid display techniques (as described in, e.g., Soumillion et al., Appl. Biochem. Biotechnol., 47, 175-89 (1994), O'Neil et al., Curr. Opin. Struct. Biol., 5(4), 443-49 (1995), Dunn, Curr. Opin. Biotechnol., 7(5), 547-53 (1996), and Koivunen et al., J. Nucl. Med., 40(5), 883-88 (1999)), reverse translation evolution (as described in, e.g., U.S. Pat. No. 6,194,550), saturation mutagenesis described in, e.g., U.S. Pat. No. 6,171,820), PCR-based synthesis shuffling (as described in, e.g., U.S. Pat. No. 5,965,408) and recursive ensemble mutagenesis (REM) (as described in, e.g., Arkin and Yourvan, Proc. Natl. Acad. Sci. USA, 89, 7811-15 (1992), and Delgrave et al., Protein Eng., 6(3), 327-331 (1993)). Techniques for introducing diversity into a library of homologous sequences also are provided in U.S. Pat. Nos. 6,159,687 and 6,228,639.

Further details regarding various diversity generating methods can be found in the following U.S. patents, PCT publications and applications, and EPO publications: U.S. Pat. No. 5,605,793 to Stemmer (Feb. 25, 1997), “Methods for In Vitro Recombination;” U.S. Pat. No. 5,811,238 to Stemmer et al. (Sep. 22, 1998) “Methods for Generating Polynucleotides having Desired Characteristics by Iterative Selection and Recombination;” U.S. Pat. No. 5,830,721 to Stemmer et al. (Nov. 3, 1998), “DNA Mutagenesis by Random Fragmentation and Reassembly;” U.S. Pat. No. 5,834,252 to Stemmer, et al. (Nov. 10, 1998) “End-Complementary Polymerase Reaction;” U.S. Pat. No. 5,837,458 to Minshull, et al. (Nov. 17, 1998), “Methods and Compositions for Cellular and Metabolic Engineering;” WO 95/22625, Stemmer and Crameri, “Mutagenesis by Random Fragmentation and Reassembly;” WO 96/33207 by Stemmer and Lipschutz “End Complementary Polymerase Chain Reaction;” WO 97/20078 by Stemmer and Crameri “Methods for Generating Polynucleotides having Desired Characteristics by Iterative Selection and Recombination;” WO 97/35966 by Minshull and Stemmer, “Methods and Compositions for Cellular and Metabolic Engineering;” WO 99/41402 by Punnonen et al. “Targeting of Genetic Vaccine Vectors;” WO 99/41383 by Punnonen et al. “Antigen Library Immunization;” WO 99/41369 by Punnonen et al. “Genetic Vaccine Vector Engineering;” WO 99/41368 by Punnonen et al. “Optimization of Immunomodulatory Properties of Genetic Vaccines;” EP 752008 by Stemmer and Crameri, “DNA Mutagenesis by Random Fragmentation and Reassembly;” EP 0932670 by Stemmer “Evolving Cellular DNA Uptake by Recursive Sequence Recombination;” WO 99/23107 by Stemmer et al., “Modification of Virus Tropism and Host Range by Viral Genome Shuffling;” WO 99/21979 by Apt et al., “Human Papillomavirus Vectors;” WO 98/31837 by del Cardayre et al. “Evolution of Whole Cells and Organisms by Recursive Sequence Recombination;” WO 98/27230 by Patten and Stemmer, “Methods and Compositions for Polypeptide Engineering;” WO 98/27230 by Stemmer et al., “Methods for Optimization of Gene Therapy by Recursive Sequence Shuffling and Selection,” WO 00/00632, “Methods for Generating Highly Diverse Libraries,” WO 00/09679, “Methods for Obtaining in Vitro Recombined Polynucleotide Sequence Banks and Resulting Sequences,” WO 98/42832 by Arnold et al., “Recombination of Polynucleotide Sequences Using Random or Defined Primers,” WO 99/29902 by Arnold et al., “Method for Creating Polynucleotide and Polypeptide Sequences,” WO 98/41653 by Vind, “An in Vitro Method for Construction of a DNA Library,” WO 98/41622 by Borchert et al., “Method for Constructing a Library Using DNA Shuffling,” and WO 98/42727 by Pati and Zarling, “Sequence Alterations using Homologous Recombination;” WO 00/18906 by Patten et al., “Shuffling of Codon-Altered Genes;” WO 00/04190 by del Cardayre et al. “Evolution of Whole Cells and Organisms by Recursive Recombination;” WO 00/42561 by Crameri et al., “Oligonucleotide Mediated Nucleic Acid Recombination;” WO 00/42559 by Selifonov and Stemmer “Methods of Populating Data Structures for Use in Evolutionary Simulations;” WO 00/42560 by Selifonov et al., “Methods for Making Character Strings, Polynucleotides & Polypeptides Having Desired Characteristics;” PCT/US00/26708 by Welch et al., “Use of Codon-Varied Oligonucleotide Synthesis for Synthetic Shuffling;” and PCT/US01/06775 “Single-Stranded Nucleic Acid Template-Mediated Recombination and Nucleic Acid Fragment Isolation” by Affholter.

Several different general classes of sequence modification methods, such as mutation, recombination, etc. are applicable to the present invention and set forth, e.g., in the references above and below. The following exemplify some of the different types of preferred formats for diversity generation in the context of the present invention, including, e.g., certain recombination based diversity generation formats.

Nucleic acids can be recombined in vitro by any of a variety of techniques discussed in the references above, including e.g., DNAse digestion of nucleic acids to be recombined followed by ligation and/or PCR reassembly of the nucleic acids. For example, sexual PCR mutagenesis can be used in which random (or pseudo random, or even non-random) fragmentation of the DNA molecule is followed by recombination, based on sequence similarity, between DNA molecules with different but related DNA sequences, in vitro, followed by fixation of the crossover by extension in a polymerase chain reaction. This process and many process variants is described in several of the references above, e.g., in Stemmer (1994) Proc. Natl. Acad. Sci. USA 91:10747-10751.

Similarly, nucleic acids can be recursively recombined in vivo, e.g., by allowing recombination to occur between nucleic acids in cells. Many such in vivo recombination formats are set forth in the references noted above. Such formats optionally provide direct recombination between nucleic acids of interest, or provide recombination between vectors, viruses, plasmids, etc., comprising the nucleic acids of interest, as well as other formats. Details regarding such procedures are found in the references noted above. Whole genome recombination methods can also be used in which whole genomes of cells or other organisms are recombined, optionally including spiking of the genomic recombination mixtures with desired library components (e.g., genes corresponding to the pathways of the present invention). These methods have many applications, including those in which the identity of a target gene is not known. Details on such methods are found, e.g., in WO 98/31837 by del Cardayre et al. “Evolution of Whole Cells and Organisms by Recursive Sequence Recombination;” and in, e.g., PCT/US99/15972 by del Cardayre et al., also entitled “Evolution of Whole Cells and Organisms by Recursive Sequence Recombination.”

Synthetic recombination methods can also be used in which oligonucleotides corresponding to targets of interest are synthesized and reassembled in PCR or ligation reactions which include oligonucleotides which correspond to more than one parental nucleic acid, thereby generating new recombined nucleic acids. Oligonucleotides can be made by standard nucleotide addition methods, or can be made, e.g., by tri-nucleotide synthetic approaches. Details regarding such approaches are found in the references noted above, including, e.g., WO 00/42561 by Crameri et al., “Oligonucleotide Mediated Nucleic Acid Recombination;” PCT/US00/26708 by Welch et al., “Use of Codon-Varied Oligonucleotide Synthesis for Synthetic Shuffling;” WO 00/42560 by Selifonov et al., “Methods for Making Character Strings, Polynucleotides and Polypeptides Having Desired Characteristics;” and WO 00/42559 by Selifonov and Stemmer “Methods of Populating Data Structures for Use in Evolutionary Simulations.”

In silico methods of recombination can be effected in which genetic algorithms are used in a computer to recombine sequence strings that correspond to homologous (or even non-homologous) nucleic acids. The resulting recombined sequence strings are optionally converted into nucleic acids by synthesis of nucleic acids that correspond to the recombined sequences, e.g., in concert with oligonucleotide synthesis/gene reassembly techniques. This approach can generate random, partially random or designed variants. Many details regarding in silico recombination, including the use of genetic algorithms, genetic operators and the like in computer systems, combined with generation of corresponding nucleic acids (and/or proteins), as well as combinations of designed nucleic acids and/or proteins (e.g., based on cross-over site selection) as well as designed, pseudo-random or random recombination methods are described in WO 00/42560 by Selifonov et al., “Methods for Making Character Strings, Polynucleotides and Polypeptides Having Desired Characteristics” and WO 00/42559 by Selifonov and Stemmer “Methods of Populating Data Structures for Use in Evolutionary Simulations.” Extensive details regarding in silico recombination methods are found in these applications. This methodology is generally applicable to the nucleic acid sequences and polypeptide sequences of the invention.

Many methods of accessing natural diversity, e.g., by hybridization of diverse nucleic acids or nucleic acid fragments to single-stranded templates, followed by polymerization and/or ligation to regenerate full-length sequences, optionally followed by degradation of the templates and recovery of the resulting modified nucleic acids can be similarly used. In one method employing a single-stranded template, the fragment population derived from the genomic library(ies) is annealed with partial, or, often approximately full length ssDNA or RNA corresponding to the opposite strand. Assembly of complex chimeric genes from this population is then mediated by nuclease-base removal of non-hybridizing fragment ends, polymerization to fill gaps between such fragments and subsequent single stranded ligation. The parental polynucleotide strand can be removed by digestion (e.g., if RNA or uracil-containing), magnetic separation under denaturing conditions (if labeled in a manner conducive to such separation) and other available separation/purification methods. Alternatively, the parental strand is optionally co-purified with the chimeric strands and removed during subsequent screening and processing steps. Additional details regarding this approach are found, e.g., in “Single-Stranded Nucleic Acid Template-Mediated Recombination and Nucleic Acid Fragment Isolation” by Affholter, PCT/US01/06775.

In another approach, single-stranded molecules are converted to double-stranded DNA (dsDNA) and the dsDNA molecules are bound to a solid support by ligand-mediated binding. After separation of unbound DNA, the selected DNA molecules are released from the support and introduced into a suitable host cell to generate a library enriched sequences, which hybridize to the probe. A library produced in this manner provides a desirable substrate for further diversification using any of the procedures described herein.

Any of the preceding general recombination formats can be practiced in a reiterative fashion (e.g., one or more cycles of mutation/recombination or other diversity generation methods, optionally followed by one or more selection methods) to generate a more diverse set of recombinant nucleic acids.

Mutagenesis employing polynucleotide chain termination methods have also been proposed (see e.g., U.S. Pat. No. 5,965,408, “Method of DNA reassembly by interrupting synthesis” to Short, and the references above), and can be applied to the present invention. In this approach, double stranded DNAs corresponding to one or more genes sharing regions of sequence similarity are combined and denatured, in the presence or absence of primers specific for the gene. The single stranded polynucleotides are then annealed and incubated in the presence of a polymerase and a chain terminating reagent (e.g., ultraviolet, gamma or X-ray irradiation; ethidium bromide or other intercalators; DNA binding proteins, such as single strand binding proteins, transcription activating factors, or histones; polycyclic aromatic hydrocarbons; trivalent chromium or a trivalent chromium salt; or abbreviated polymerization mediated by rapid thermocycling; and the like), resulting in the production of partial duplex molecules. The partial duplex molecules, e.g., comprising partially extended chains, are then denatured and re-annealed in subsequent rounds of replication or partial replication resulting in polynucleotides which share varying degrees of sequence similarity and which are diversified with respect to the starting population of DNA molecules. Optionally, the products, or partial pools of the products, can be amplified at one or more stages in the process. Polynucleotides produced by a chain termination method, such as described above, are suitable substrates for any other described recombination format.

Diversity also can be generated in nucleic acids or populations of nucleic acids using a recombination procedure known as “incremental truncation for the creation of hybrid enzymes” (“ITCHY”) described in Ostermeier et al. (1999) “A combinatorial approach to hybrid enzymes independent of DNA homology” Nature Biotech 17:1205. This approach can be used to generate an initial a library of variants, which can optionally serve as a substrate for one or more in vitro or in vivo recombination methods. See, also, Ostermeier et al. (1999) “Combinatorial Protein Engineering by Incremental Truncation,” Proc. Natl. Acad. Sci. USA, 96: 3562-67; Ostermeier et al. (1999), “Incremental Truncation as a Strategy in the Engineering of Novel Biocatalysts,” Biological and Medicinal Chemistry, 7: 2139-44.

Mutational methods which result in the alteration of individual nucleotides or groups of contiguous or non-contiguous nucleotides can be favorably employed to introduce nucleotide diversity. Many mutagenesis methods are found in the above-cited references; additional details regarding mutagenesis methods can be found in following, which can also be applied to the present invention. For example, error-prone PCR can be used to generate nucleic acid variants. Using this technique, PCR is performed under conditions where the copying fidelity of the DNA polymerase is low, such that a high rate of point mutations is obtained along the entire length of the PCR product. Examples of such techniques are found in the references above and, e.g., in Leung et al. (1989) Technique 1:11-15 and Caldwell et al. (1992) PCR Methods Applic. 2:28-33. Similarly, assembly PCR can be used, in a process which involves the assembly of a PCR product from a mixture of small DNA fragments. A large number of different PCR reactions can occur in parallel in the same reaction mixture, with the products of one reaction priming the products of another reaction.

Oligonucleotide directed mutagenesis can be used to introduce site-specific mutations in a nucleic acid sequence of interest. Examples of such techniques are found in the references above and, e.g., in Reidhaar-Olson et al. (1988) Science, 241:53-57. Cassette mutagenesis can be used in a process that replaces a small region of a double stranded DNA molecule with a synthetic oligonucleotide cassette that differs from the native sequence. The oligonucleotide can include, e.g., completely and/or partially randomized native sequence(s).

Recursive ensemble mutagenesis is a process in which an algorithm for protein mutagenesis is used to produce diverse populations of phenotypically related mutants, members of which differ in amino acid sequence. This method uses a feedback mechanism to monitor successive rounds of combinatorial cassette mutagenesis. Examples of this approach are found in Arkin & Youvan (1992) Proc. Natl. Acad. Sci. USA 89:7811-7815. Exponential ensemble mutagenesis can be used for generating combinatorial libraries with a high percentage of unique and functional mutants. Small groups of residues in a sequence of interest are randomized in parallel to identify, at each altered position, amino acids which lead to functional proteins. Examples of such procedures are in Delegrave & Youvan (1993) Biotechnology Research 11:1548-1552.

In vivo mutagenesis can be used to generate random mutations in any cloned DNA of interest by propagating the DNA, e.g., in a strain of E. coli that carries mutations in one or more of the DNA repair pathways. These “mutator” strains have a higher random mutation rate than that of a wild-type parent. Propagating the DNA in one of these strains will eventually generate random mutations within the DNA. Such procedures are described in the references noted above. Alternatively, in vivo recombination techniques can be used. For example, a multiplicity of monomeric polynucleotides sharing regions of partial sequence similarity can be transformed into a host species and recombined in vivo by the host cell. Subsequent rounds of cell division can be used to generate libraries, members of which, include a single, homogenous population, or pool of monomeric polynucleotides. Alternatively, the monomeric nucleic acid can be recovered by standard techniques, e.g., PCR and/or cloning, and recombined in any of the recombination formats, including recursive recombination formats, described above. Other techniques that can be used for in vivo recombination and sequence diversification are described in U.S. Pat. No. 5,756,316.

Methods for generating multispecies expression libraries have been described (in addition to the reference noted above, see, e.g., Peterson et al. (1998) U.S. Pat. No. 5,783,431 “METHODS FOR GENERATING AND SCREENING NOVEL METABOLIC PATHWAYS,” and Thompson, et al. (1998) U.S. Pat. No. 5,824,485 METHODS FOR GENERATING AND SCREENING NOVEL METABOLIC PATHWAYS) and their use to identify protein activities of interest has been proposed (In addition to the references noted above, see Short (1999) U.S. Pat. No. 5,958,672 “PROTEIN ACTIVITY SCREENING OF CLONES HAVING DNA FROM UNCULTIVATED MICROORGANISMS”). Multispecies expression libraries include, in general, libraries comprising cDNA or genomic sequences from a plurality of species or strains, operably linked to appropriate regulatory sequences, in an expression cassette. The cDNA and/or genomic sequences are optionally randomly ligated to further enhance diversity. The vector can be a shuttle vector suitable for transformation and expression in more than one species of host organism, e.g., bacterial species, eukaryotic cells. In some cases, the library is biased by preselecting sequences which encode a protein of interest, or which hybridize to a nucleic acid of interest. Any such libraries can be provided as substrates for any of the methods herein described.

The polynucleotide sequences of the present invention can be engineered by standard techniques to make additional modifications, such as, the insertion of new restriction sites, the alteration of glycosylation patterns, the alteration of pegylation patterns, modification of the sequence based on host cell codon preference, the introduction of recombinase sites, and the introduction of splice sites.

In some applications, it is desirable to preselect or prescreen libraries (e.g., an amplified library, a genomic library, a cDNA library, a normalized library, etc.) or other substrate nucleic acids prior to diversification, e.g., by recombination-based mutagenesis procedures, or to otherwise bias the substrates towards nucleic acids that encode functional products. Libraries can also be biased towards nucleic acids which have specified characteristics, e.g., hybridization to a selected nucleic acid probe.

“Non-Stochastic” methods of generating nucleic acids and polypeptides are alleged in Short “Non-Stochastic Generation of Genetic Vaccines and Enzymes” WO 00/46344. These methods, including proposed non-stochastic polynucleotide reassembly and site-saturation mutagenesis methods are applicable to the present invention as well. Random or semi-random mutagenesis using doped or degenerate oligonucleotides is also described in, e.g., Arkin and Youvan (1992) “Optimizing nucleotide mixtures to encode specific subsets of amino acids for semi-random mutagenesis” Biotechnology 10:297-300; Reidhaar-Olson et al. (1991) “Random mutagenesis of protein sequences using oligonucleotide cassettes” Methods Enzymol. 208:564-86; Lim and Sauer (1991) “The role of internal packing interactions in determining the structure and stability of a protein” J. Mol. Biol. 219:359-76; Breyer and Sauer (1989) “Mutational analysis of the fine specificity of binding of monoclonal antibody 51F to lambda repressor” J. Biol. Chem. 264:13355-60); and “Walk-Through Mutagenesis” (U.S. Pat. Nos. 5,830,650 and 5,798,208, and European Patent 0 527 809).

It will readily be appreciated that any of the above described techniques suitable for enriching a library prior to diversification can also be used to screen the products, or libraries of products, produced by the diversity generating methods.

Kits for mutagenesis, library construction and other diversity generation methods are also commercially available. For example, kits are available from, e.g., Stratagene (e.g., QuickChange™ site-directed mutagenesis kit; and Chameleon™ double-stranded, site-directed mutagenesis kit), Bio/Can Scientific, Bio-Rad (e.g., using the Kunkel method described above), Boehringer Mannheim Corp., Clonetech Laboratories, DNA Technologies, Epicentre Technologies (e.g., 5 prime 3 prime kit); Genpak Inc, Lemargo Inc, Life Technologies (Gibco BRL), New England Biolabs, Pharmacia Biotech, Promega Corp., Quantum Biotechnologies, Amersham International plc (e.g., using the Eckstein method above), and Anglian Biotechnology Ltd (e.g., using the Carter/Winter method above).

The above references provide many mutational formats, including recombination, recursive recombination, recursive mutation and combinations or recombination with other forms of mutagenesis, as well as many modifications of these formats. Regardless of the diversity generation format that is used, the nucleic acids of the invention can be recombined (with each other, or with related (or even unrelated) sequences) to produce a diverse set of recombinant nucleic acids, including, e.g., sets of homologous nucleic acids, as well as corresponding polypeptides.

A recombinant nucleic acid produced by recombining one or more polynucleotide sequences of the invention with one or more additional nucleic acids using any of the above-described formats alone or in combination forms a part of the invention. The one or more additional nucleic acids may include another polynucleotide of the invention; optionally, alternatively, or in addition, the one or more additional nucleic acid can include, e.g., a nucleic acid encoding a naturally-occurring dengue virus prM and/or E protein-encoding sequence, a prM and/or E sequence of another flavivirus, or, e.g., any other homologous or non-homologous nucleic acid or fragments thereof (certain recombination formats noted above, notably those performed synthetically or in silico, do not require homology for efficient recombination).

Desirably, the recombinant polypeptides obtained by the above-described recombination methods are functional chimeric polypeptides. For example, a polypeptide produced by one of the above-described methods often and desirably induces an immune response against a polypeptide encoded by the first nucleic acid as well as against a polypeptide encoded by the second nucleic acid in a subject. Because multiple, preferably at least three, and more preferably, at least four, or more, nucleic acids are used in the recursive sequence recombination techniques of the invention, a polypeptide obtained from a nucleic acid product of such recombination reactions typically comprises two or more peptide fragments or peptide portions unique to a polypeptide encoded by one of the parental sequences. Each such peptide fragment or peptide portion is an amino acid sequence of one or more contiguous amino acids, usually at least about 10, at least about 15, at least about 20, or at least about 30 or more amino acids in length. Such peptide fragments or peptide portions can be identified by sequence analysis techniques. Examples of peptides of the invention having such sequence diversity (or complex chimerism) are described in the Examples section below. The unique peptide fragments or peptide portions corresponding to any particular peptide encoded by a parental sequence are separated from each other by peptide fragments or peptide portions, respectively, corresponding to peptides encoded by other parental nucleic acids. The parental peptide fragments or portions can be any suitable size. Typically, a parental peptide fragment or portion comprises at least about 10, at least about 15, at least about 20, and at least about 30 or more amino acids in length. Multiple peptide fragments or peptide portions desirably include epitopes present in the peptides encoded by the parental nucleic acids. In this respect, a polypeptide expressed from a recombinant nucleotide sequence of the invention desirably comprises at least about one, two, or more T-cell epitopes and/or antigenic sequences also present in the polypeptide encoded by at least one parental nucleic acid, and, more preferably, by each parental nucleic acid. Of course, recombination also can result in novel coding sequences not present in any of the parental nucleic acid sequences, and, as such, produce novel epitopes not observed in any of the parental nucleic acid sequences.

Polynucleotides produced by the above-described recombination, mutagenesis, and standard nucleotide synthesis techniques can be screened for any suitable characteristic, such as the expression of a recombinant polypeptide having any of the desirable characteristics attendant the novel dengue virus antigens of the invention, which are discussed in detail elsewhere herein. Polypeptides produced by such techniques and having such characteristics are an important feature of the invention. For example, the invention provides a recombinant polypeptide encoded by a recombinant polynucleotide produced by recursive sequence recombination with any nucleic acid sequence of the invention that induces an immune response against one or more dengue viruses of one, two, three, or preferably four virus serotypes in a subject. Preferably, the immune response induced by administration or expression of such a polypeptide in the subject is about equal to or greater than the immune response induced by a polypeptide encoded by the first nucleic acid, the immune response induced by a polypeptide encoded by the second nucleic acid, or both.

A polypeptide of the invention can comprise any suitable number of non-dengue virus antigen peptide fragments. Correspondingly, a polynucleotide of the invention can comprise any suitable number of nucleic acid fragments encoding such non-dengue virus antigen peptide fragments. Such peptide fragments are typically fused to the C or N terminus of a polypeptide of the invention, as desired, to form a fusion protein. Such fusion proteins are an important feature of the invention. In general, the invention provides a polypeptide having any of the above-described characteristics of the invention, that further includes a heterologous fusion partner (non dengue-antigen encoding peptide fragment or peptide portion), that exhibits at least one biological property that is separately detectable from the rest of the polypeptide in a subject. Numerous types of heterologous fusion partners can be incorporated into the polypeptide in addition to the immunogenic amino acid sequence (and optional signal sequence) of the invention described above.

A particularly useful fusion partner is a peptide fragment or peptide portion that facilitates purification of the polypeptide (“polypeptide purification subsequence”). Several types of suitable polypeptide purification subsequences are known in the art. Examples of such fusion partners include a polyhistidine sequence, histidine-tryptophan modules that allow purification on immobilized metals, such as a hexa-histidine peptide, a sequence encoding such a tag is incorporated in the pQE vector available from QIAGEN, Inc. (Chatsworth, Calif.), a sequence which binds glutathione (e.g., glutathione-S-transferase (GST)), a hemagglutinin (HA) tag (corresponding to an epitope derived from the influenza hemagglutinin protein; Wilson, I. et al. (1984) Cell 37:767), maltose binding protein sequences, the FLAG epitope utilized in the FLAGS extension/affinity purification system (Immunex Corp, Seattle, Wash.)—commercially available FLAG epitopes also are available through Kodak (New Haven, Conn.), thioredoxin (TRX), avidin, and the like. The inclusion of a protease-cleavable polypeptide linker sequence between the purification domain and the immunogenic amino acid sequence or immunogenic amino acid sequence/signal sequence portion of the polypeptide is useful to facilitate purification of an immunogenic fragment of the fusion protein. Histidine residues facilitate purification on IMIAC (immobilized metal ion affinity chromatography, as described in Porath et al. (1992) Protein Expression and Purification 3:263-281) while the enterokinase cleavage site provides a method for separating the polypeptide from the fusion protein. pGEX vectors (Promega; Madison, Wis.) conveniently can be used to express foreign polypeptides as fusion proteins with glutathione S-transferase (GST). Additional examples of such sequences and the use thereof for protein purification are described in, e.g., International Patent Application WO 00/15823. After expression of the polypeptide and isolation thereof by such fusion partners or otherwise (as described above), protein refolding steps can be used, as desired, in completing configuration of the mature polypeptide.

A fusion protein of the invention also can include one or more additional peptide fragments or peptide portions which promote detection of the fusion protein. For example, a reporter peptide fragment or portion (e.g., green fluorescent protein (GFP), β-galactosidase, or a detectable domain thereof) can be incorporated in the fusion protein. Additional marker molecules that can be conjugated to the polypeptide of the invention include radionuclides, enzymes, fluorophores, small molecule ligands, and the like.

A polypeptide of the invention can further be modified by the inclusion of at least one modified amino acid. The inclusion of one or more modified amino acids may be advantageous in, for example, (a) increasing polypeptide serum half-life, (b) reducing polypeptide antigenicity, or (c) increasing polypeptide storage stability. Amino acid(s) are modified, for example, co-translationally or post-translationally during recombinant production (e.g., N-linked glycosylation at N-X-S/T motifs during expression in mammalian cells) or modified by synthetic means. Non-limiting examples of a modified amino acid include a glycosylated amino acid, a sulfated amino acid, a prenlyated (e.g., farnesylated, geranylgeranylated) amino acid, an acetylated amino acid, an acylated amino acid, a PEG-ylated amino acid, a biotinylated amino acid, a carboxylated amino acid, a phosphorylated amino acid, and the like. References adequate to guide one of skill in the modification of amino acids are replete throughout the literature. Example protocols are found in Walker (1998) PROTEIN PROTOCOLS ON CD-ROM Humana Press, Towata, N.J. Preferably, the modified amino acid is selected from a glycosylated amino acid, a PEGylated amino acid, a farnesylated amino acid, an acetylated amino acid, a biotinylated amino acid, an amino acid conjugated to a lipid moiety, and an amino acid conjugated to an organic derivatizing agent.

Another feature of the invention is a polypeptide comprising an immunogenic amino acid sequence as described above and further comprising a targeting sequence other than, or in addition to, a signal sequence. For example, the polypeptide can comprise a sequence that targets a receptor on a particular cell type (e.g., a monocyte, dendritic cell, or associated cell) to provide targeted delivery of the polypeptide to such cells and/or related tissues. Signal sequences are described above, and include membrane localization/anchor sequences (e.g., stop transfer sequences, GPI anchor sequences), and the like.

In another aspect, the polypeptide can comprise a fusion partner that promotes stability of the polypeptide, secretion of the polypeptide (other than by signal targeting), or both. For example, the polypeptide can comprise an immunoglobulin (Ig) domain, such as an IgG polypeptide comprising an Fc hinge, a CH2 domain, and a CH3 domain, that promotes stability and/or secretion of the polypeptide.

A fusion protein of the invention can further include additional immunogenic amino acid sequences. For example, the fusion protein can comprise an amino acid sequence that has substantial identity (e.g., at least about 75%, 80%, 85%, 86%, 87%, 88% or 89%, preferably at least about 90%, 91%, 92%, 93%, 94%, 95%, 96% 97%, 98%, 99%, 99.5% sequence identity) with a sequence fragment or portion of a flavivirus capsid protein, preferably a dengue virus capsid protein (e.g., DEN-2 or DEN-4), of at least about 20 amino acids in length. For example, the polypeptide can comprise a fusion protein of an immunogenic amino acid sequence of the invention and a yellow fever virus or adenovirus envelope, capsid, or other protein. Alternatively or additionally, a polypeptide of the invention (or polynucleotide encoding a polypeptide of the invention) can be administered with one or more dengue capsid proteins, or portions or fragments of such proteins, such as a portion or fragment of a dengue capsid comprising a T cell epitope (examples of such epitopes are known in the art (see, e.g., Gagnon et al. J Virol, 70(1):141-147 (1996)), or one or more polynucleotides encoding such polypeptides or peptides. The polypeptide also or alternatively can comprise, or be administered with one or more dengue virus nonstructural proteins, or one or more nucleic acids encoding such proteins or substantially identical peptides (e.g., having at least about 75%, 80%, 85%, 86%, 87%, 88% or 89%, preferably at least about 90%, 91%, 92%, 93%, or 94%, and more preferably at least about 95% (e.g., about 87-95%), 96% 97%, 98%, 99%, 99.5% sequence identity). For example, the addition of a non-structural protein (e.g., NS1, NS2A, NS2B, NS3, NS4A, NS4B, and/or NS5 proteins) may increase a T cell response, if it includes one or more additional T cell epitopes. The invention includes polypeptides that have at least about 80%, 85%, 90% or more sequence identity to a WT NS dengue virus protein and uses of such polypeptides as described herein. The co-delivery of polypeptides comprising nonstructural protein B cell and/or T cell epitopes (like those found in N1 (also referred to as NS-1 or NS1), N3 (NS-3 or NS3), or N5 (NS-5 or NS5)—see, e.g., Mathew et al. J Clin Invest, 98(7): 1684-1692 (1996), Okamoto et al. J Gen Virol 79:697-704 (1998), Green et al., Virology 234(2):383-386 (1997), and Garcia et al. Am J Trop Med Hyg 56(4):466-70 (1997)), or co-expression of sequences encoding at least one such polypeptide, can improve the level of the multivalent immune response induced by the delivery and/or expression of the recombinant polypeptides of the invention in subjects, such as mammals.

The fusion protein peptide fragments or peptide portions can be associated in any suitable manner. Typically and preferably, the first and second peptide fragments or portions are covalently associated (e.g., by means of a peptide or disulfide bond). The peptide fragments or portions can be directly fused (e.g., the C-terminus of the immunogenic amino acid sequence can be fused to the N-terminus of a purification sequence or heterologous immunogenic sequence). The fusion protein can include any suitable number of modified bonds, e.g., isosteres, within or between the peptide portions. Alternatively, the fusion protein can include a peptide linker between the peptide fragments or portions that includes one or more amino acid sequences not forming part of the biologically active peptide portions. Any suitable peptide linker can be used. The linker can be any suitable size. Typically, the linker is less than about 30 amino acid residues, preferably less than about 20 amino acid residues, and more preferably about 10 or less than 10 amino acid residues. Typically, the linker predominantly comprises or consists of neutral amino acid residues. Suitable linkers are generally described in, e.g., U.S. Pat. Nos. 5,990,275, 6,010,883, 6,197,946, and European Patent Application 0 035 384. If separation of peptide fragments or peptide portions is desirable a linker that facilitates separation can be used. An example of such a linker is described in U.S. Pat. No. 4,719,326. “Flexible” linkers, which are typically composed of combinations of glycine and/or serine residues, can be advantageous. Examples of such linkers are described in, e.g., McCafferty et al., Nature, 348, 552-554 (1990), Huston et al., Proc. Natl. Acad. Sci. USA, 85, 5879-5883 (1988), Glockshuber et al., Biochemistry, 29, 1362-1367 (1990), and Cheadle et al., Molecular Immunol., 29, 21-30 (1992), Huston et al., Proc. Natl. Acad. Sci. USA, 85, 5879-5883 (1988), Bird et al., Science, 242, 423-26 (1988), and U.S. Pat. Nos. 5,672,683, 6,165,476, and 6,132,992.

The use of a linker also can reduce undesired immune response to the fusion protein created by the fusion of the two peptide fragments or peptide portions, which can result in an unintended MHC I and/or MHC II epitope being present in the fusion protein. In addition to the use of a linker, identified undesirable epitope sequences or adjacent sequences can be PEGylated (e.g., by insertion of lysine residues to promote PEG attachment) to shield identified epitopes from exposure. Other techniques for reducing immunogenicity of the fusion protein of the invention can be used in association with the administration of the fusion protein include the techniques provided in U.S. Pat. No. 6,093,699.

A recombinant polypeptide of the invention also desirably does not comprise irrelevant epitopes (i.e., non-dengue relevant epitopes) or inter-epitope junctions. Such techniques also can be used to prevent presentation of irrelevant epitopes and epitope-junctions. Techniques for analyzing epitopes are further provided in the Examples section, which can be used to rationally design recombinant antigens without such irrelevant and/or undesired epitopes.

Fragments of polypeptides of the invention also can be useful in promoting an immune response to a dengue virus in a subject. The invention provides such fragments and methods of use thereof. For example, the invention provides a fragment of a polypeptide of the invention that is at least about 75 amino acids in length, and is not identical to a fragment of a wild-type envelope protein or wild-type prM/E protein of DEN-1, DEN-2, DEN-3, or DEN-4, wherein the fragment induces an immune response to at least one dengue virus of at least one serotype in a subject. Particular polypeptide fragments induce an immune response to one or more dengue viruses of multiple serotypes (e.g., two, three, or all four known serotypes), including a neutralizing antibody response to one or more dengue viruses of each of two, three or four serotypes, and most preferably induce a protective immune response against one or more dengue viruses of each of two, three or four serotypes when administered or expressed appropriately in a subject.

In addition to encoding and expressing recombinant immunogenic polypeptides of the invention, the nucleic acids also can be useful for sense and anti-sense suppression of expression (e.g., to control expression levels in tissues away from those in which expression of an administered nucleic acid or vector is desired). A variety of sense and anti-sense technologies are known in the art, see, e.g., Lichtenstein & Nellen (1997) ANTISENSE TECHNOLOGY: A PRACTICAL APPROACH IRL Press at Oxford University, Oxford, England, Agrawal (1996) ANTISENSE THERAPEUTICS Humana Press, NJ, and references cited therein.

The invention further provides nucleic acids that comprise a nucleic acid sequence that is the substantial complement (i.e., comprises a sequence that complements at least about 90%, preferably at least about 95%), and more preferably the complement, of any of the above-described nucleic acid sequences. Such complementary nucleic acid sequences are useful in probes, production of the nucleic acid sequences of the invention, and as antisense nucleic acids for hybridizing to nucleic acids of the invention. Short oligonucleotide sequences comprising sequences that complement the nucleic acid, e.g., of about 15, 20, 30, or 50 bases (preferably at least about 12 bases), which hybridize under highly stringent conditions to a nucleic acid of the invention are useful as probes (e.g., to determine the presence of a nucleic acid of the invention in a particular cell or tissue and/or to facilitate the purification of nucleic acids of the invention). Polynucleotides comprising complementary sequences also can be used as primers for amplification of the nucleic acids of the invention.

The invention further provides a fragment of a nucleic acid of the invention that comprises a sequence that encodes a unique subsequence in a polypeptide selected from the group of SEQ ID NOS:1-49 and 153-155 as compared to any of SEQ ID NOS:338-341. Also provided is a fragment of a nucleic acid of the invention that comprises a sequence that encodes a unique subsequence in a polypeptide selected from the group of SEQ ID NOS:65-116 as compared to any of SEQ ID NOS:149-152. Also provided is a fragment of a nucleic acid of the invention that comprises a sequence that encodes a unique subsequence in a polypeptide selected from the group of SEQ ID NOS:139-148, 236-253, 343, and 345 as compared to any of SEQ ID NOS:227-230.

The invention also provides a fragment of a nucleic acid of the invention that comprises a unique sequence of nucleotides of at least about 300, preferably at least about 400, more preferably at least about 600, desirably at least about 900, and more desirably at least about 1200 nucleotides from a sequence selected from any of SEQ ID NOS:285-330, as compared to a wild-type dengue tE-polypeptide encoding sequence selected from any of SEQ ID NOS:338-341 and similar known dengue virus truncated E polypeptide-encoding nucleotide sequences, including those available in GenBank, and more preferably as compared to any known wild-type flaviviral truncated E polypeptide-encoding nucleotide sequence. Such fragment induces an immune response to at least one dengue virus of at least one serotype in a subject when administered and is useful in, among other things, methods of inducing an immune response in the subject and cells thereof against such at least one dengue virus (e.g., neutralizing Ab response against at least one dengue virus serotype).

In another aspect, the invention provides a fragment of a nucleic acid of the invention that comprises a unique sequence of nucleotides of at least about 300, preferably at least about 400, more preferably at least about 600, desirably at least about 900, and more desirably at least about 1200 nucleotides from a sequence selected from any of SEQ ID NOS:156-200 and 235, as compared to a wild-type dengue PRM15/tE-polypeptide encoding sequence selected from any of SEQ ID NOS:149-152 and similar known dengue virus PRM15/tE polypeptide-encoding nucleotide sequences, including those available in GenBank, and more preferably as compared to any known wild-type flaviviral PRM15/tE polypeptide-encoding nucleotide sequence. Such fragment induces an immune response in a subject to at least one dengue virus of at least one serotype and is useful in, among other things, methods of inducing an immune response in a subject or population of the subject's cells against at least one dengue virus of at least one serotype (e.g., neutralizing Ab response against one or more viruses of one or more dengue virus serotypes).

In another aspect, the invention provides a fragment of a nucleic acid of the invention that comprises a unique sequence of nucleotides of at least about 300, preferably at least about 400, more preferably at least about 600, desirably at least about 900, and more desirably at least about 1200 nucleotides from a sequence selected from any of SEQ ID NOS:201-210, 211-218, 254-271, 342, and 344, as compared to a non-codon optimized WT C15/full prM/full E-polypeptide encoding sequence selected from any of SEQ ID NOS:227-230 and a known dengue virus C15/full prM/full E polypeptide-encoding nucleotide sequences, including those available in GenBank, and more preferably as compared to any known WT flaviviral C15/full prM/full E polypeptide-encoding nucleotide sequence. Such fragment induces an immune response in a subject or population of its cells to at least one dengue virus of at least one serotype and is useful in, among other things, methods of inducing an immune response in such subject or cells against at least one dengue virus of at least one serotype (e.g., neutralizing Ab response against one or more serotypes).

The invention also provides a nucleic acid that selectively hybridizes to at least one of SEQ ID NOS:285-330, or the complement thereof, than as compared to a wild-type dengue tE-polypeptide encoding sequence selected from any of SEQ ID NOS:338-341 and a known dengue virus truncated E polypeptide-encoding nucleotide sequences, including those in GenBank, and more preferably as compared to any known wild-type flaviviral truncated E polypeptide-encoding nucleotide sequence, or the complement thereof, as applicable.

The invention also provides a nucleic acid that selectively hybridizes to at least one of SEQ ID NOS:156-200, 211-214, and 235, or the complement thereof, as compared to a wild-type dengue PRM15/tE-polypeptide encoding sequence selected from any of SEQ ID NOS:149-152 and similar known dengue virus PRM15/tE polypeptide-encoding nucleotide sequences, including those available in GenBank, and more preferably as compared to any known wild-type flaviviral PRM15/tE polypeptide-encoding nucleotide sequence, or the complement thereof, as applicable.

The invention also provides a nucleic acid that selectively hybridizes to at least one of SEQ ID NOS:201-210, 215-218, 254-271, 342, and 344, or the complement thereof, as compared to a non-codon optimized wild-type dengue C15/full prM/full E-polypeptide encoding sequence selected from any of SEQ ID NOS:227-230 and a known dengue virus C15/full prM/full E polypeptide-encoding nucleotide sequences, including those available in GenBank, and more preferably as compared to any known wild-type flaviviral C15/full prM/full E polypeptide-encoding nucleotide sequence, as applicable.

The invention further provides a composition and/or a nucleic acid obtained by cleaving a nucleic acid of the invention. The nucleic acid can be cleaved by mechanical, chemical, or enzymatic cleavage. Techniques for cleavage of nucleic acids are known in the art. Cleavage by enzymatic cleavage, particularly endonuclease, exonuclease digestion, RNAse digestion, or DNAse (e.g., benzon nuclease, such as Benzonase®) digestion of the nucleic acid. The composition also can comprise the products of cleaving multiple nucleic acids of the invention by such techniques.

The invention further provides a composition and/or nucleic acid produced by a process that comprises incubating a nucleic acid of the invention in the presence of nucleotide triphosphates (NTPs, preferably dNTPs) and a nucleic acid polymerase. Typically, and preferably, the polymerase is a thermostable polymerase, such as a Taq polymerase.

In another aspect, the invention provides a library or pool of non-identical nucleic acids of the invention and/or a library of nucleic acids comprising at least one nucleic acid of the invention. For example, the invention provides a library of nucleic acids comprising at least one nucleic acid having substantial identity (e.g., at least about 75%, 80%, 85%, 86%, 87%, 88% or 89%, preferably at least about 90%, 91%, 92%, 93%, or 94%, and more preferably at least about 95% (e.g., about 87-95%), 96% 97%, 98%, 99%, 99.5% sequence identity) with at least one polynucleotide sequence selected from the group of SEQ ID NOS:156-218, 235, 254-271, 285-330, 342, and 344.

In another context, the invention provides a library of non-identical nucleic acids that have substantial identity (e.g., at least about 75%, 80%, 85%, 86%, 87%, 88% or 89%, preferably at least about 90%, 91%, 92%, 93%, or 94%, and more preferably at least about 95% (e.g., about 87-95%), 96% 97%, 98%, 99%, 99.5% sequence identity) with at least one sequence selected from SEQ ID NOS:156-218, 235, 254-271, 285-330, 342, and 344. For example, the library in either case might be a library obtained by the above-described recursive sequence recombination techniques.

Thus, for example, the invention provides a composition comprising a library of nucleic acids obtained by a method comprising recombining at least a first nucleic acid comprising a sequence selected from SEQ ID NOS:211-214 and/or 215-218, and at least a second nucleic acid, wherein the first and second nucleic acids differ from each other in two or more nucleotides, to produce a library of recombinant or synthetic nucleic acids. The invention also provides nucleic acids produced by similar recombination reactions using any nucleic acid of the invention.

The invention also provides a method of recombination comprising subjecting at least one nucleic acid sequence of the invention to recursive sequence recombination with at least one additional nucleic acid, as described above. Regardless of how the library or pool is produced, the library or pool can comprises any suitable number of nucleic acid species therein. For example, the library can comprise at least about 2, 5, 10, 50 or more non-identical nucleic acids of the invention. The library can be inserted into one or more cells, e.g., by library transfection techniques. Such a population of cells is also contemplated.

The invention also provides a composition comprising such a library. For example, a library of nucleic acids as described above can be used for diagnosis of gene expression (e.g., by way of “gene chip” technology well-known in the art). Such libraries can be subject to expression, hybridization, or any other form of analysis for any suitable diagnostic purpose.

The invention also provides a method of producing a modified nucleic acid comprising mutating a nucleic acid of the invention (e.g., a nucleic acid selected from the group of SEQ ID NOS:156-218, 235, 254-271, 285-330, 342, and 344). The invention further provides a modified nucleic acid produced by this method. Methods for mutating nucleic acids of the invention are described elsewhere herein.

The polynucleotide of the invention can be in the form of an aptamer (as described in, e.g., Famulok and Mayer—“http” colon, slash, slash, “www” dot “chemie” dot “uni-bonn” dot “de” slash “oc” slash “ak_fa” slash “publications” slash “CTMI-paper” dot “pdf” capable of binding to suitable targets. The nucleic acids of the invention also can be used to form triplex-forming inhibitory nucleotides. The nucleic acid also can be conjugated to a DNA binding domain, such that they silence gene expression of undesired genes (e.g., act as gene decoys).

The invention also provides protein mimetics of the polypeptides of the invention. Peptide mimetics are described in, e.g., U.S. Pat. No. 5,668,110 and the references cited therein. Furthermore, the fusion protein can be modified by the addition of protecting groups to the side chains of one or more the amino acids of the fusion protein. Such protecting groups can facilitate transport of the fusion peptide through membranes, if desired, or through certain tissues, for example, by reducing the hydrophilicity and increasing the lipophilicity of the peptide. Examples of suitable protecting groups include ester protecting groups, amine protecting groups, acyl protecting groups, and carboxylic acid protecting groups, which are known in the art (see, e.g., U.S. Pat. No. 6,121,236). Synthetic fusion proteins of the invention can take any suitable form. For example, the fusion protein can be structurally modified from its naturally occurring configuration to form a cyclic peptide or other structurally modified peptide. The polypeptide of the invention also can be linked to one or more nonproteinaceous polymers, typically a hydrophilic synthetic polymer, e.g., polyethylene glycol (PEG), polypropylene glycol, or polyoxyalkylene, as described in, e.g., U.S. Pat. Nos. 4,179,337, 4,301,144, 4,496,689, 4,640,835, 4,670,417, and 4,791,192, or a similar polymer such as polyvinylalcohol or polyvinylpyrrolidone (PVP). As discussed above, the polypeptide can be subject to common protein modifications, such as carboxylation, glycosylation, hydroxylation, lipid or lipid derivative-attachment, methylation, myristylation, phosphorylation, and sulfation. Other post-translational modifications include acetylation, acylation, ADP-ribosylation, amidation, covalent attachment of flavin, covalent attachment of a heme moiety, covalent attachment of a nucleotide or nucleotide derivative, covalent attachment of phosphotidylinositol, cross-linking, cyclization, disulfide bond formation, demethylation, formylation, GPI anchor formation, iodination, oxidation, proteolytic processing, prenylation, racemization, selenoylation, arginylation, and ubiquitination. Other common protein modifications are described in, e.g., Creighton, supra, Seifter et al. (1990) Meth Enzymol 18:626-646, and Rattan et al. (1992) Ann NY Acad Sci 663:48-62. Such modifications are usually the result of post-translational modifications that occur in recombinant polypeptides of the invention (alternatively, such modifications can be carried out synthetically). Post-translational modifications for polypeptides expressed from nucleic acids in host cells vary depending what kind of host or host cell type the peptide is expressed in. For instance, glycosylation often does not occur in bacterial hosts such as E. coli and varies considerably in baculovirus systems as compared to mammalian cell systems. Accordingly, when glycosylation is desired, a polypeptide should be expressed (produced) in a glycosylating host, generally a eukaryotic cell (e.g., a mammalian cell or an insect cell). Additional and particularly preferred protein modifications are discussed elsewhere herein. Modifications to the polypeptide can be verified by any suitable technique, including, e.g., x-ray diffraction, NMR imaging, mass spectrometry, and/or chromatography (e.g., reverse phase chromatography, affinity chromatography, or GLC).

The polypeptide also or alternatively can comprise any suitable number of non-naturally occurring amino acids (e.g., β amino acids) and/or alternative amino acids (e.g., selenocysteine), or amino acid analogs, such as those listed in the Manual of Patent Examining Procedure §2422 (7th Revision—2000), which can be incorporated by protein synthesis, such as through solid phase protein synthesis (as described in, e.g., Merrifield (1969) Adv Enzymol 32:221-296 and other references cited herein).

Recently, the production of fusion proteins comprising a prion-determining domain has been used to produce a protein vector capable of non-Mendelian transmission to progeny cells (see, e.g., Li et al., J. Mol. Biol., 301(3), 567-73 (2000)). The inclusion of such prion-determining sequences in a fusion protein comprising immunogenic amino acid sequences of the invention is contemplated, ideally to provide a hereditable protein vector comprising the fusion protein that does not require a change in the host's genome.

The invention also provides a polypeptide which comprises an amino acid sequence of at least about 45 amino acids in length, preferably at least about 55 amino acids in length, and more preferably at least about 80 amino acids in length, corresponding to a fragment of a polypeptide of any one of SEQ ID NOS:139-148, 236-253, 343, and 345, wherein the amino acid sequence is unique as compared to a polypeptide encoded by any of SEQ ID NOS:215-217, and known dengue virus C15/full prM/full E polypeptides. In addition, the invention provides a polypeptide which comprises an amino acid sequence of at least about 45 amino acids in length, preferably at least about 55 amino acids in length, and more preferably at least about 80 amino acids in length, corresponding to a fragment of a polypeptide of any one of SEQ ID NOS:156-200 and 235, wherein the amino acid sequence is unique as compared to a polypeptide encoded by any of SEQ ID NOS:211-214, and known dengue virus PRM15/tE polypeptides.

In another aspect, the invention provides a polypeptide which is specifically bound by polyclonal antisera raised against at least one antigen, the at least one antigen comprising an amino acid sequence selected from the group of SEQ ID NOS:1-49 and 153-155, or an antigenic or immunogenic fragment thereof, wherein said antigenic or immunogenic polypeptide fragment induces an immune response in a subject against at least one dengue virus of at least one virus serotype that is about equal to or greater than the immune response induced in the mammalian cell by a antigenic or immunogenic polypeptide fragment of the at least one dengue virus of the at least one serotype; wherein the polyclonal antisera is subtracted with at least one of (1) a truncated envelope protein selected from the group of SEQ ID NOS:338-341 and (2) a truncated envelope protein comprising an amino acid sequence fragment of a known dengue virus truncated E polypeptide, wherein said amino acid sequence fragment has a length substantially identical (e.g., having at least about 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96% 97%, 98%, 99%, 99.5% sequence identity) to the truncated E protein of any of SEQ ID NOS:338-341.

The invention also includes an antibody or antisera that specifically binds a polypeptide, the polypeptide comprising an amino acid sequence selected from the group of SEQ ID NOS:1-49 and 153-155, wherein the antibody or antisera does not specifically bind to a polypeptide comprising one or more of: the polypeptides of SEQ ID NOS:338-341 and known dengue virus truncated E proteins.

Further provided is a recombinant or synthetic polypeptide which is specifically bound by a polyclonal antisera raised against at least one antigen, the antigen comprising an amino acid sequence selected from the group of SEQ ID NOS:65-116, or a fragment thereof, wherein the antisera is subtracted with polypeptides encoded by SEQ ID NOS:149-152, and known dengue virus PRM15/tE polypeptides.

Further provided is a recombinant or synthetic polypeptide which is specifically bound by a polyclonal antisera raised against at least one antigen, the antigen comprising an amino acid sequence selected from the group of SEQ ID NOS:139-148, 236-253, 343, and 345, or a fragment thereof, wherein the antisera is subtracted with polypeptides encoded by SEQ ID NOS:227-230, and known dengue virus C15/full prM/full E polypeptides.

In general, the polypeptides of the invention provide structural features that can be recognized, e.g., in immunological assays. The production of antisera comprising at least one antibody (for at least one antigen) that specifically binds a polypeptide of the invention, and the polypeptides which are bound by such antisera, are features of the invention. Binding agents, including antibodies described herein, may bind the dengue antigen polypeptides of the invention and/or fragments thereof with affinities of about 1×10² M⁻¹ to about 1×10¹⁰ M⁻¹ (i.e., about 10⁻²-10⁻¹⁰ M) or greater, including about 10⁴ to 10⁶ M⁻¹, about 10⁶ to 10⁷ M⁻¹, or about 10⁸ M⁻¹ to 10⁹ M⁻¹ or 10¹⁰ M⁻¹. Conventional hybridoma technology can be used to produce antibodies having affinities of up to about 10⁹ M⁻¹. However, new technologies, including phage display and transgenic mice, can be used to achieve higher affinities (e.g., up to at least about 10¹² M⁻¹). In general, a higher binding affinity is advantageous.

In order to produce antiserum or antisera for use in an immunoassay, at least one immunogenic polypeptide (or polypeptide-encoding polynucleotide) of the invention is produced and purified as described herein. For example, recombinant polypeptide may be produced in a mammalian cell line. Alternatively, an inbred strain of mice can immunized with the immunogenic protein(s) in combination with a standard adjuvant, such as Freund's adjuvant or alum, and a standard mouse immunization protocol (see Harlow and Lane, supra, for a standard description of antibody generation, immunoassay formats and conditions that can be used to determine specific immunoreactivity). Alternatively, at least one synthetic or recombinant polypeptide derived from at least one polypeptide sequence disclosed herein or expressed from at least one polynucleotide sequence disclosed herein can be conjugated to a carrier protein and used as an immunogen for the production of antiserum. Polyclonal antisera typically are collected and titered against the immunogenic polypeptide in an immunoassay, for example, a solid phase immunoassay with one or more of the immunogenic proteins immobilized on a solid support. In the above-described methods where novel antibodies and antisera are provided, antisera resulting from the administration of the polypeptide (or polynucleotide and/or vector) with a titer of about 10⁶ or more typically are selected, pooled and subtracted with the control co-stimulatory polypeptides to produce subtracted pooled titered polyclonal antisera.

Cross-reactivity of antibodies can be determined using standard techniques, such as competitive binding immunoassays and/or parallel binding assays, and standard calculations for determining the percent cross-reactivity. Usually, where the percent cross-reactivity is at least 5-10× as high for the test polypeptides, the test polypeptides are said to specifically bind the pooled subtracted antisera or antibody.

Antisera raised or induced by an immunizing antigen may bind related antigens (e.g., cross-react). In such instance, the cross-reacting antigens comprise the same or substantially equivalent epitopes or comprise epitopes that are, e.g., sufficiently similar in shape to bind the same antibody.

The invention also provides polynucleotide consensus sequences derived from a comparison of two or more of the polynucleotide sequences described herein (e.g., a consensus sequence obtained by comparison of two or more sequences selected from, e.g., SEQ ID NOS:156-218, 235, 254-271, 285-330, 342, and 344). Preferably, the nucleic acid provides a non-naturally-occurring or recombinant polynucleotide comprising a sequence obtained by selection of nucleic acids from such a consensus sequence. The invention also provides a polypeptide consensus sequence obtained by similar analysis of at least two polypeptides of the invention (e.g., by comparison of any two amino acids selected from the group of SEQ ID NOS:1-49, 65-116, 139-148, 153-155, 236-253, 343, and 345). The invention further provides a polypeptide comprising a sequence according to the sequence pattern (formula) obtained by such consensus sequence analysis. Examples of such formulas are described herein, such as the one obtained by standard alignment of the polypeptides.

The invention also provides polypeptides and/or polynucleotides that comprise a sequence that has substantial local sequence identity, in contrast to global/overall sequence identity (discussed primarily herein), to one of the sequences specifically disclosed herein. Local sequence identity can be determined using local sequence alignment software, e.g., the BLAST programs described above, the LFASTA program, or, more preferably, the LALIGN program. Preferably, the LALIGN program using a BLOSUM50 matrix analysis is used for amino acid sequence analysis, and a +5 match/−4 mismatch analysis is used for polynucleotide sequence analysis. Gap extension and opening penalties are preferably the same as those described above with respect to analysis with the ALIGN program. For LALIGN (or other program) analysis using k-tup value settings (also referred to as “k-tuple” or ktup values), a k-tup value of 0-3 for proteins, and 0-10 (e.g., about 6) for nucleotide sequences, is preferred. The invention provides in this respect, for example, a polynucleotide that comprises a sequence that has substantial local identity (i.e., a percent identity similar to the percent identities discussed with respect to substantially identical sequences) (e.g., having at least about 75%, 80%, 85%, 86%, 87%, 88% or 89%, preferably at least about 90%, 91%, 92%, 93%, or 94%, and more preferably at least about 95% (e.g., about 87-95%), 96% 97%, 98%, 99%, 99.5% sequence identity) over a sequence of at least about 300, desirably at least about 450, more desirably at least about 600, preferably at least about 900, and more preferably at least about 1200 nucleotides, with an immunogenic amino acid-encoding portion of a nucleic acid of the invention, despite lacking substantial identity due to gap penalties and/or differences in length of the analyzed sequences. Similarly, the invention provides a polypeptide that comprises an amino acid sequence of at least about 10, preferably at least about 20, more preferably at least about 50, favorably at least about 100, more favorably at least about 200, or more (e.g., at least about 300, 350, or 375) amino acid residues that has substantial local identity with an immunogenic amino acid sequence of one of the recombinant polypeptides of the invention (e.g., a recombinant tE sequence, full E sequence, PRM15/tE sequence, C15/full prM/tE sequence, C15/full prM/full E sequence, etc.).

A recombinant polypeptide of the invention desirably, though not necessarily, exhibits primary, secondary, and/or tertiary structural similarity to a wild-type dengue virus polypeptide, or at least a fragment thereof of similar length to the recombinant polypeptide. For example, a recombinant tE, full E, PRM15/tE, C15/full prM/tE, or C15/full prM/full E polypeptide may exhibit primary, secondary, and/or tertiary structural similarity to a wild-type dengue virus polypeptide tE, full E, PRM15/tE, C15/full prM/tE, or C15/full prM/full E polypeptide, respectively. Structural similarity can be determined by any suitable technique, preferably using a suitable software program for making such assessments. Examples of such programs include the MAPS program and the TOP program (described in Lu, Protein Data Bank Quarterly Newsletter, #78, 10-11 (1996), and Lu, J. Appl. Cryst., 33, 176-183 (2000)). The polypeptide and/or prM/E protein desirably exhibit topological diversity in such contexts (e.g., a topical diversity of less than about 20, preferably less than about 15, and more preferably less than about 10), or both. However, polypeptides having high structural diversity can be suitable and even preferred. Such diverse, but functional, peptides can be obtained by recursive sequence recombination techniques described above. Alternatively, the structure of proteins can be compared using the PROCHECK program (described in, e.g., Laskowski, J. Appl. Cryst., 26, 283-291 (1993)), the MODELLER program, or commercially available programs incorporating such features. Alternatively still, structure predictions can be compared by way of a sequence comparison using a program such as the PredictProtein server (available at “http” colon, slash. slash, “dodo” dot “cpmc” dot “Columbia” dot “edu” slash “predictprotein” slash). Additional techniques for analyzing protein structure that can be applied to determine structural similarity are described in, e.g., Yang and Honig, J. Mol. Biol., 301(3), 665-78 (2000), Aronson et al., Protein Sci., 3(10), 1706-11 (1994), Marti-Remon et al., Annu. Rev. Biophys. Biomol. Struct., 29, 291-325 (2000), Halaby et al., Protein Eng., 12(7), 563-71 (1999), Basham, Science, 283, 1132 (1999), Johnston et al., Crit. Rev. Biochem. Mol. Biol., 29(J), 1-68 (1994), Moult, Curr. Opin. Biotechnol., 10(6), 583-6 (1999), Benner et al., Science, 274, 1448-49 (1996), and Benner et al., Science, 273, 426-8 (1996), as well as Int'l Patent Application WO 00/45334.

Kits of the invention optionally comprise at least one of the following of the invention: (1) an apparatus, system, system component, or apparatus component as described herein; (2) at least one kit component comprising at least one polypeptide, polynucleotide (or fragment of either thereof), vector, antibody, and/or cell of the invention; one or more cells comprising at least one polypeptide, polynucleotide, vector, and/or antibody of the invention; a cell expressing a polypeptide of the invention; a composition, pharmaceutical composition, or vaccine composition (composition suitable for use as a vaccine in mammals) comprising at least one of any component above (e.g., polypeptide, polynucleotide, vector, antibody, and/or cell of the invention) or any combination thereof; (3) instructions for practicing any method described herein, including methods of inducing immune response, methods of immunizing, methods of detecting or diagnosing the presence or one or more antibodies to at least one dengue virus of one or more serotypes in a biological sample, therapeutic or prophylactic methods, instructions for using any component identified in (2) or any vaccine or composition of any such component; and/or instructions for operating any apparatus, system or component described herein; (4) a container for holding said at least one such component or composition, and (5) packaging materials. In a further aspect, the present invention provides for the use of any apparatus, component, composition, or kit described above and herein, for the practice of any method or assay described herein, and/or for the use of any apparatus, component, composition, or kit to practice any assay or method described herein.

The invention provides computers, computer readable media, and integrated systems comprising character strings corresponding to the sequence information herein for the polypeptides and nucleic acids herein, including, e.g., those sequences listed herein and various silent substitutions and conservative substitutions thereof. Various methods and genetic algorithms (GAs) known in the art can be used to detect homology or similarity between different character strings, or can be used to perform other desirable functions such as to control output files, provide the basis for making presentations of information including the sequences and the like. Examples include BLAST, discussed supra.

Different types of homology and similarity of various stringency and length can be detected and recognized in the integrated systems herein. E.g., many homology determination methods have been designed for comparative analysis of sequences of biopolymers, for spell-checking in word processing, and for data retrieval from various databases. With an understanding of double-helix pair-wise complement interactions among 4 principal nucleobases in natural polynucleotides, models that simulate annealing of complementary homologous polynucleotide strings can also be used as a foundation of sequence alignment or other operations typically performed on the character strings corresponding to the sequences herein (e.g., word-processing manipulations, construction of figures comprising sequence or subsequence character strings, output tables, etc.). An example of a software package with GAs for calculating sequence similarity is BLAST, which can be adapted to the invention by inputting character strings corresponding to the sequences herein.

Similarly, standard desktop applications such as word processing software (e.g., Microsoft Word™ or Corel WordPerfect™) and database software (e.g., spreadsheet software such as Microsoft Excel™, Corel Quattro Pro™, or database programs such as Microsoft Access™ or Paradox™) can be adapted to the present invention by inputting a character string corresponding to the polypeptides or polynucleotides of the invention or both, or fragments of either. For example, the integrated systems can include the foregoing software having the appropriate character string information, e.g., used in conjunction with a user interface (e.g., a GUI in a standard operating system such as a Windows, Macintosh or LINUX system) to manipulate strings of characters. As noted, specialized alignment programs such as BLAST can also be incorporated into the systems of the invention for alignment of nucleic acids or proteins (or corresponding character strings).

Integrated systems for analysis in the present invention typically include a digital computer with GA software for aligning sequences, as well as data sets entered into the software system comprising any of the sequences described herein. The computer can be, e.g., a PC (Intel x86 or Pentium chip-compatible DOS™, OS2™ WINDOWS™ WINDOWS NT™, WINDOWS95™, WINDOWS98™ LINUX based machine, a MACINTOSH™, Power PC, or a UNIX based (e.g., SUN™ work station) machine) or other commercially common computer which is known to one of skill. Software for aligning or otherwise manipulating sequences is available, or can easily be constructed by one of skill using a standard programming language such as Visualbasic, Fortran, Basic, Java, or the like.

Any controller or computer optionally includes a monitor which is often a cathode ray tube (“CRT”) display, a flat panel display (e.g., active matrix liquid crystal display, liquid crystal display), or others. Computer circuitry is often placed in a box which includes numerous integrated circuit chips, such as a microprocessor, memory, interface circuits, and others. The box also optionally includes a hard disk drive, a floppy disk drive, a high capacity removable drive such as a writeable CD-ROM, and other common peripheral elements. Inputting devices such as a keyboard or mouse optionally provide for input from a user and for user selection of sequences to be compared or otherwise manipulated in the relevant computer system.

The computer may include appropriate software for receiving user instructions, either in the form of user input into a set parameter fields, e.g., in a GUI, or in the form of preprogrammed instructions, e.g., preprogrammed for a variety of different specific operations. The software then converts these instructions to appropriate language for instructing the operation of the fluid direction and transport controller to carry out the desired operation. The software can also include output elements for controlling nucleic acid synthesis (e.g., based upon a sequence or an alignment of a sequence herein) or other operations which occur downstream from an alignment or other operation performed using a character string corresponding to a sequence herein.

In a particular aspect of such an embodiment of the invention, the invention provides a computer or computer readable medium comprising a database comprising a sequence record comprising one or more character string corresponding to a nucleic acid sequence selected from the group of SEQ ID NOS:156-218, 235, 254-271, 285-330, 342, and 344, or a polypeptide sequence selected from the group of SEQ ID NOS:1-49, 65-116, 139-148, 153-155, 236-253, 343, and 345.

The invention provides an integrated system comprising a computer or computer readable medium comprising a database comprising at least one sequence record, each comprising at least one character string corresponding to a nucleic acid or protein sequence selected from any of SEQ ID NOS:1-49, 65-116, 139-148, 153-218, 235-253, 254-271, 285-330, 342-345, the integrated system further comprising a user input interface allowing a user to selectively view one or more sequence records. For some such integrated systems, the computer or computer readable medium comprising an alignment instruction set which aligns the character strings with at least one additional character string corresponding to a nucleic acid or protein sequence. The instruction set may comprise one or more of: a local homology comparison determination, a homology alignment determination, a search for similarity determination, and a BLAST determination. Some such systems may also comprise a user readable output element that displays an alignment produced by the alignment instruction set.

In some aspects, the computer or computer readable medium further comprises an instruction set which translates at least one nucleic acid sequence comprising a sequence selected from the group of SEQ ID NOS:156-218, 235, 254-271, 285-330, 342, and 344 into an amino acid sequence. In other aspects, the computer or computer readable medium further comprising an instruction set for reverse-translating at least one amino acid sequence comprising a sequence selected from SEQ ID NOS:149, 65-116, 139-148, 153-155, 236-253, 343, and 345 into a nucleic acid sequence. For some such systems, the instruction set selects the nucleic acid sequence by applying a codon usage instruction set or an instruction set which determines sequence identity to a test nucleic acid sequence.

Also provided is a method of using a computer system to present information pertaining to at least one of a plurality of sequence records stored in a database, each of said sequence records each comprising at least one character string corresponding to one or more of SEQ ID NOS:1-49, 65-116, 139-148, 153-218, 235-253, 254-271, 285-330, 342-345, the method comprising: determining a list of one or more character strings corresponding to one or more of said above-referenced SEQ ID NOS, or a subsequence thereof; determining which one or more character strings of said list are selected by a user; and displaying the selected character strings, or aligning the selected character strings with an additional character string. Some such methods further comprise displaying an alignment of the selected character string with the additional character string and/or displaying the list.

In a further aspect, the invention provides a method of generating and/or selecting a polypeptide variant comprising using a character string corresponding to at least one of SEQ ID NOS:1-49, 65-116, 139-148, 153-155, 236-253, 343, and 345, preferably in combination with additional biological functional information (e.g., neutralizing antibody titer against one or more dengue virus serotypes) to an algorithm, preferably facilitated by a computer media, the analyzed outcome of which generates one or more amino acid sequences, pattern of sequence characteristics, sequence changes (mutations), and/or structures that are suggestive of a polypeptide that exhibits a similar and/or improved biological property of such polypeptides. The method can include subjecting the character strings to any suitable type of genetic modeling or algorithm known in the art, including, e.g., statistical analysis techniques such as Markov modeling, principal component analysis, neural network analysis, random recombination-modeling approaches, and physical recombination approaches. Examples of such techniques are described in, e.g., International Patent Applications WO 01/83559, WO 99/49893, and WO 01/61344, and U.S. Pat. No. 6,269,312. Additional techniques and principles are described in, e.g., International Patent Applications WO 00/42561, WO 01/51663, and WO 01/90197 as well as Norton et al. Virus Res 55(1):37-48 (1998), Rappuoli, Curr Opin Microbiol 3(5):445-450 (2000), Petersen et al., Scand J. Immunol 53(4):357-364 (2001), and Nakai J. Struct Biol 134(2-3): 103-116 (2001). Amino acid and nucleotide sequences produced having a non-wild-type sequence generated by such in silico modeling using a character string corresponding to SEQ ID NOS:1-49, 65-116, 139-148, 153-218, 235-253, 254-271, 285-330, 342-345 are a feature of the invention.

Any of the above described features of the polypeptides, polynucleotides, vectors, cells, compositions, and methods of the invention can be combined in any suitable manner, unless otherwise stated or clearly contradicted by context.

Any molecule of the invention, including any nucleic acid, polypeptide, protein, peptide, or fusion protein of the invention, or any vector, cell, or composition comprising any such molecule as described herein, can be used in any of the methods and applications of the invention described herein. In one aspect, the invention provides for the use of any such molecule, including any nucleic acid, polypeptide, protein, peptide, or fusion protein, or any vector, cell, or composition comprising any such molecule as described herein, as a medicament, drug, therapeutic or prophylactic agent, or vaccine, for the treatment or prevention of a disease or disorder, including those diseases and disorders described herein (e.g., those related to dengue virus infection), or the like. In another aspect, the invention provides for the use of any such molecule (e.g., any nucleic acid, polypeptide, protein, fusion protein, or peptide of the invention) or any vector, cell, or composition comprising any such molecule, for the manufacture of a medicament, prophylactic or therapeutic agent, drug, or vaccine, for use in any applicable therapeutic or prophylactic method for the treatment or prevention of a disease or disorder, including those described herein (e.g., those related to dengue virus infection).

EXAMPLES

The following examples further illustrate the invention, but should not be construed as in any way limiting its scope in any way.

Example 1

This example illustrates the generation and identification of novel nucleic acids that encode recombinant, synthetic or mutant dengue virus antigens, polypeptides comprising such dengue virus antigens, the construction of an exemplary DNA vectors for delivery of such nucleic acids to mammalian cells, and the expression of such nucleic acids by transfection of mammalian cells with such a vector, resulting in high levels of expression and secretion of the encoded dengue antigens.

A. Synthesis of Novel Dengue Antigen-Encoding Nucleotide Sequences

The amino acid sequences of the various proteins of each of the four WT DEN-1, DEN-2, DEN-3, and DEN-4 viruses were analyzed to identify the following regions of the polyprotein of each of the four WT dengue virus serotypes: (1) the amino acid segment or fragment of the dengue polyprotein sequence corresponding to the C-terminal 15 amino acids of the prM protein (“PRM15”), which 15 amino acid segment of the prM protein may serve as a signal sequence for the dengue envelope (E) protein; (2) the amino acids comprising most of the full length E protein sequence of the dengue polyprotein sequence, i.e., excluding those amino acid residues corresponding to approximately 5% to about 12% of the C-terminal region of the E protein sequence that encode a portion of the hydrophobic region). Thus, an amino acid sequence was identified from the polyprotein sequence of each of the four WT dengue virus serotypes; each such amino acid sequence comprised a fusion protein comprising the C-terminal 15 amino acid residues of the prM protein (e.g., 15 amino acid residues of the M protein) and approximately 90-95% of the N-terminal amino acid residues of the E protein (hence, a termed a “truncated” E protein). See FIG. 14. These amino acid sequences identified from each of the wild-type DEN-1, DEN-2, DEN-3, and DEN-4 polyproteins were termed Den-1 PRM15/trunc E (or Den-1PRM15/tE) (SEQ ID NO:149), Den-2PRM15/trunc E (or Den-2PRM15/tE) (SEQ ID NO:150), Den-3PRM15/trunc E (or Den-3PRM15/tE) (SEQ ID NO:151), and Den-4PRM15/trunc E (or Den-4PRM15/tE) (SEQ ID NO:152) dengue antigen fusion proteins, respectively. Collectively, these sequences may be termed the “truncated” parental polypeptides. These fusion proteins can be made by expression from plasmid vectors, such as E. coli vectors, viral vectors, baculovirus vectors, or other plasmid vectors, and produced in insect cells, E. coli cells, or mammalian cell cultures by standard techniques known in the art and described herein. Alternatively, proteins can be assembled from protein fragments or peptide fragments made by standard protein synthesis techniques well known in the art and discussed above.

Each of these four identified amino acid sequences was back translated to a nucleotide sequence, using a standard human codon frequency table (“http” colon, slash, slash, “www” dot “kazusa” dot “orp” slash “codon” slash “cgi-bin” slash “showcodon” dot “cgi?species=Homo+sapiens+[gbpri]”), to obtain a human codon optimized DNA sequence.

DNA oligonucleotides comprising overlapping portions of each identified human codon optimized PRM15 and truncated E fusion protein-encoding sequence were synthesized using standard techniques for each of the 4 dengue virus serotypes. For each of the virus serotypes, the overlapping oligos that collectively made up the PRM15/E truncated protein-encoding sequence were permitted to hybridize. DNA sequences corresponding to the codon optimized sequences were produced by standard PCR gene synthesis using the hybridized oligos as templates. For each of the PCR gene synthesis reactions, a 5′ forward primer comprising nucleotide sequences overlapping the PRM15 sequence, an additional BamHI site and a 5′-ACC-3′ Kozak consensus sequence (Cell 15:1109-23 (1978)) and a 3′ reverse primer comprising nucleotide sequences overlapping the truncated E gene and a EcoRI restriction site were used, such that these sites and the consensus sequence were added to the resulting PRM15/truncated E-encoding PCR products. The resulting human codon optimized PRM15/truncated E-encoding DNA sequences were designated Den-1PRM15/truncE CO (or Den-1PRM15/tE CO) (SEQ ID NO:211), Den-2PRM15/truncE CO (or Den-2PRM15/tE CO) (SEQ ID NO:212), Den-3PRM15/truncE CO (or Den-3PRM15/tE CO (SEQ ID NO:213), and Den-4PRM15/truncE CO (or Den-4PRM15/tE CO) (SEQ ID NO:214), respectively, where “PRM15” refers to a signal nucleotide sequence comprising a sequence encoding the 15 amino acids of the C terminus of prM (i.e., PRM15) and typically an additional methionine residue (which is the first amino acid residue before the 15 amino acid sequence), “trunc” E or “tE” refers to the nucleotide sequence encoding a truncated E protein, and “CO” refers to codon optimized. Collectively, these nucleotide sequences may be termed the truncated parental nucleotide sequences, since they were used as parental nucleic acids in the recursive sequence recombination methods described herein.

Similar procedures are used to identify and make a polypeptide sequence comprising: the 15 amino acid residues of the capsid (C) protein of a particular WT dengue virus serotype polyprotein (e.g., Den-1, Den-2, Den-3, or Den-4), the amino acid sequence corresponding to the full length prM protein of said WT dengue virus polyprotein, and the amino acid sequence corresponding to said full length E protein of the WT dengue virus polyprotein. Such procedures are conducted using each of the four WT dengue virus polyproteins, and the resulting polypeptide sequences are termed Den-1 C15/full prM/full E (or Den-1 C15/prM/E) (SEQ ID NO:227), Den-2 C15/full prM/full E (or Den-2 C15/prM/E) (SEQ ID NO:228), Den-3 C15/full prM/full E (or Den-3 C15/prM/E) (SEQ ID NO:229), Den-4 C15/full prM/full E (or Den-4 C15/prM/E) (SEQ ID NO:230) dengue antigen fusion proteins, respectively.

Human codon optimized sequences that encode each of these dengue antigen fusion proteins are determined and made as described above, and the resulting nucleotide sequences are termed: Den-1 C15/full prM/full E CO (or Den-1 C15/prM/E CO) (SEQ ID NO:231), Den-2 C15/full prM/full E CO (or Den-2 C15/prM/E CO) (SEQ ID NO:232), Den-3 C15/full prM/full E CO (or Den-3 C15/prM/E CO) (SEQ ID NO:233), and Den-4 C15/full prM/full E CO (or Den-4 C15/prM/E CO) (SEQ ID NO:234) dengue antigen nucleotide sequences, respectively.

B. Construction of pMaxVax10.1

An exemplary mammalian expression vector termed “pMaxVax10.1” (see FIG. 1) comprises, among other things: (1) a promoter for driving the expression of a transgene (or other nucleotide sequence) in a mammalian cell (including, e.g., but not limited to, a CMV promoter or a variant thereof, and shuffled, synthetic, or recombinant promoters, including those described in PCT application having International Publication No. WO 02/00897; (2) a polylinker for cloning of one or more transgenes (or other nucleotide sequence); (3) a polyadenylation signal (e.g., polyA sequence); and (4) a prokaryotic replication origin and antibiotic resistant gene for amplification in E. coli. The construction of the vector is briefly described herein, although several suitable alternative techniques are available to produce such a DNA vector (e.g., applying the principles described elsewhere herein).

In one embodiment, the minimal plasmid Col/Kana comprises the replication origin ColE1 and the kanamycin resistance gene (Kana^(r)). The ColE1 origin of replication (ori) mediates high copy number plasmid amplification.

In one embodiment, the ColE1 ori was isolated from vector pUC19 (New England Biolabs, Inc.) by application of standard PCR techniques. To link the ColE1 origin to the Kana^(r) gene, NgoMIV (or “NgoMI”) and DraIII recognition sequences were added to the 5′ and 3′ PCR primers, respectively. NgoMIV and DraIII are unique cloning sites in pMaxVax10.1. For subsequent cloning of the mammalian transcription unit, the 5′ forward primer also was designed to include the additional restriction site NheI downstream of the NgoMIV site and EcoRV and BsrGI cloning sites upstream of the DraIII site the 3′ reverse primer. All of the primers were designed to include additional 6-8 base pairs overhang for optimal restriction digest. Specifically, the sequence for the 5′ forward primer (“pMaxVax primer 1”) is acacatagcgccggcgctagctgagcaaaaggccagcaaaaggcca (SEQ ID NO:331) and the sequence for the 3′ reverse primer (“pMaxVax primer 2”) aactctgtgagacaacagtcataaatgtacagatatcagaccaagtttactcatatatac (SEQ ID NO:332).

Typically, the ColE1 PCR reactions were performed with proof-reading polymerases, such as Tth (PE Applied Biosystems), Pfu, PfuTurbo and Herculase (Stratagene), or Pwo (Roche), under conditions in accordance with the manufacturer's recommendations. By way of illustration, a typical Herculase polymerase PCR reaction contains 1 μl template plasmid DNA (1-10 ng/μl), 5 μl 10× buffer, 1 μl dNTPs (deoxynucleotide triphosphates) at 10 mM each, 1 μl forward primer (20 μM), 1 μl reverse primer (20 μM), 40 μl deionized, sterile water and 0.5 μl Herculase polymerase in a 50 μl reaction. Such PCR reactions were performed at 94° C. for 30 seconds, 55° C. for 30 seconds, and 72° C. for 30 seconds per cycle, for a total of 25 cycles.

The ColE1 PCR product was purified with phenol/chloroform using Phase lock Gel™Tube (Eppendorf) followed by standard ethanol precipitation. The purified ColE1 PCR product was digested with the restriction enzymes NgoMIV and DraIII according to the manufacturer's recommendations (New England Biolabs, Inc.) and gel purified using the QiaExII gel extraction kit (Qiagen) according to the manufacturer's instructions.

In this embodiment, the Kanamycin resistance gene (transposon Tn903) was isolated from plasmid pACYC177 (New England-Biolabs, Inc.) using standard PCR techniques. Specifically, a 5′ PCR primer (“pMaxVax primer 3”), ggcttctcacagagtggcgcgccgtgtctcaaaatctct (SEQ ID NO:333), comprising sequences homologous to the 5′ kanamycin gene and an additional DraIII site upstream of an AscI site, and a 3′ primer (“pMaxVax primer 4”), ttgctcagctagcgccggcgccgtcccgtcaagtcagcgt (SEQ ID NO:334), comprising sequences homologous to the 3′ kanamycin gene and a NgoMIV cloning site, were used to amplify the Kana^(r) gene from pACYC177. The PCR reactions, product purification and digest with DraIII and NgoMIV were performed as described above. About 20 ng of each of the Kana^(r) PCR product and ColE1 PCR product were obtained and ligated in a 20 μl reaction, containing 2 μl 10× buffer and 1 U ligase (Roche). Amplification in E. coli was performed using standard procedures as described in Sambrook, supra. Plasmids were purified with the QiaPrep-spin Miniprep kit (Qiagen) following the manufacturer's instructions and digested with BsrG1 and DraIII for subsequent ligation of the mammalian transcription unit (promoter and polyA).

In one embodiment, the pMaxVax10.1 vector comprise a CMV immediate early enhancer promoter (CMV IE), which was isolated from DNA of the CMV virus, Towne strain, by standard PCR methods. The cloning sites EcoRI and BamHI were incorporated into the PCR forward and reverse primers. The EcoRI and BamHI digested CMV IE PCR fragment was cloned into pUC19 for amplification. The CMV promoter was isolated from the amplified pUC19 plasmid by restriction digest with BamHI and BsrGI. The BsrGI site is located 168 bp downstream of the 5′ end of the CMV promoter, resulting in a 1596 bp fragment, which was isolated by standard gel purification techniques for subsequent ligation.

In one embodiment, a polyadenylation signal from the bovine growth hormone (BGH) gene was used. Other polyadenylation signals (e.g., SV40 poly A sequences) may also be employed. In this instance, a BGH nucleotide sequence was isolated from the pcDNA3.1 vector (Invitrogen) by standard PCR techniques. Briefly, a 5′ PCR forward primer (“pMaxVax primer 5”), agatctgtttaaaccgctgatcagcctcgactgtgccttc (SEQ ID NO:335), which includes recognition sites for the restriction enzymes PmeI and BglII to form part of the p.MaxVax10.1 vector polylinker, and a 3′ reverse primer (“pMaxVax primer 6”), acctctaaccactctgtgagaagccatagagcccaccgca (SEQ ID NO:336), which includes a DraIII site for cloning to the minimal plasmid Col/Kana, were prepared by standard techniques and used to amplify a BGH polyA PCR product. The BGH polyA PCR product was diluted 1:100. 1 μl of the diluted BGH polyA PCR product was used as a template for a second PCR amplification using the same 3′ reverse primer and a second 5′ primer (“pMaxVax primer 7”), ggatccggtacctctagagaattcggcggccgcagatctgtttaaaccgctga (SEQ ID NO:337), which overlapped the 5′ end of the template by 20 bp, and contained another 40 bp 5′ sequence comprising BamHI, KpnI, XbaI, EcoRI, and NotI restriction sites for inclusion of these sites in the p.MaxVax10.1 vector polylinker.

The final ligation reaction to form pMaxVax10.1 was performed with about 20 ng each of the BsrG1 and BamHI digested CMV IE PCR product, BamHI and DraIII digested polylinker and BGH poly A PCR product, and the DraIII and BsrG1 digested minimal plasmid Col/Kana in a 50 μl reaction with 5 μl 10× ligase buffer and 2 U ligase (Roche). Ligation, amplification and plasmid purification were performed as described above. The plasmid was transfected into E. coli using standard techniques for cloning.

C. Construction of pMaxVax10.1Dengue Virus Antigen Expression Vector

In one aspect, BamHI and EcoRI digested and gel-purified Den-1PRM15/tE CO, Den-2PRM15/tE CO, Den-3PRM15/tE CO, and Den-4PRM15/tE CO nucleic acids (described above) were each cloned into the pMaxVax10.1 vector by digesting the vector with BamHI and EcoRI using standard techniques, gel purifying the linearized vector, and ligating the respective dengue antigen-encoding sequences (separately in 4 different ligation reactions to the vector to form 4 plasmid vector constructs, each comprising a codon optimized nucleic acid encoding Den-1PRM15/tE, Den-2PRM15/tE, Den-3PRM15/tE, or Den-4PRM15/tE fusion protein antigen (see FIG. 2). Such vectors are typically termed pMaxVax10.1_(Den-1PRM15/tE CO), pMaxVax10.1_(Den-2PRM15/tE CO), pMaxVax10.1_(Den-3PRM15/tE CO), and pMaxVax10.1_(Den-4PRM15/tE CO), respectively. FIG. 2 shows an exemplary pMaxVax10.1 expression vector comprising a nucleotide sequence corresponding to Den-2PRM15/tE CO. These dengue antigen fusion protein-encoding expression vectors were propagated in E. coli using standard techniques to obtain a population of plasmids for transfection experiments.

Similar procedures are used to construct pMaxVax10.1 expression vectors comprising each of the nucleotide sequences described above: Den-1 C15/full prM/full E CO (SEQ ID NO:231), Den-2 C15/full prM/full E CO (SEQ ID NO:232), Den-3 C15/full prM/full E CO (SEQ ID NO:233), and Den-4 C15/full prM/full E CO (SEQ ID NO:234).

D. Dengue Antigen Expression and Secretion Analysis for Den-1PRM15/tE CO, Den-2PRM15/tE CO, Den-3PRM15/tE CO, and Den-4PRM15/tE CO

Dengue antigens encoded by wild-type dengue nucleic acids (or portions or fragments thereof), are typically poorly expressed and secreted. To assess the expression and/or secretion of dengue antigens expressed from the above-described pMaxVax10.1 expression vectors the following techniques were performed.

Populations of human 293-HEK cells were grown in tissue culture under standard conditions, and each population was transfected with a pMaxVax10.1 expression vector comprising one of the nucleic acid sequences described above: Den-1PRM15/tE CO, Den-2PRM15/tE CO, Den-3PRM15/tE CO, or Den-4PRM15/tE CO. Transfections were carried out using commercially available transfections reagents, typically Effectene (Qiagen) and FuGene (Boehringer/Roche), in accordance with the manufacturer's instructions (e.g., with respect to amount of plasmid used for transfection). Each population of transfected 293 cells was incubated for about 48-72 hours under conditions permissive for transgene expression.

Two sets of samples of each of the transfected 293 cell cultures were prepared. The first set of samples was prepared by harvesting the supernatants of each population of cells transfected with one of the 4 expression plasmids and concentrating the volume 10-20 fold by standard methods (e.g., centrifugation using Ultrafree-4 centrifugal filter, Millipore, Mass.). The protein secretion levels were determined by Western blot analysis. In the other set of samples, cell of each population of cells transfected with one of the 4 expression plasmids were harvested, subjected to cell lysis, and the resulting lysate collected and subjected to Western blot analysis to determine expression levels.

For Western blot analysis, the proteins from the cell lysates and cell supernatants, respectively, were separated by polyacrylamide gel electrophoresis and blotted to nitrocellulose membranes using the technique described by the manufacturers (NuPage™ and Invitrogen, respectively). Cell lysates were run in one lane (marked “L”) and cell supernatants were run in another lane (marked “SN”) on a single polyacrylamide gel for each nucleic acid tested (e.g., p.MaxVax10.1_(Den-2PRM15/tE CO) transfected cell lysates and supernatants were run on a single gel).

Polyclonal anti-DEN-1 mouse antisera and anti-DEN-3 mouse antisera were obtained from ATCC, polyclonal anti-DEN-4 mouse antisera were received from the WHO antibody collection, and a monoclonal anti-DEN-2 antibody was received from United States Biological (Swampscott Mass.—USA) (polyclonal anti-DEN-2 mouse antisera repeatedly failed to bind to denatured-wild-type DEN-2 antigens on filters and, accordingly, were not used).

The filters were incubated with dengue virus serotype specific antibodies corresponding to the type of dengue antigen encoded by the nucleic acid used to transfect the samples (i.e., anti-DEN-2 mAb was used for filters blotted with the lysates or supernatant from pMaxVax10.1_(Den-2PRM15/tE CO) transfected cells). The antibody-antigen incubations were performed for 1 hour at room temperature, the filters were washed 5 times for 25 minutes with PBS Buffer and 0.1% Tween 20, and the filters were further incubated with a secondary enzyme-conjugated (either a horse radish peroxidase (HRP)-conjugated or alkaline phosphatase-conjugated) anti-mouse antibody. After a 1-hour incubation at room temperature, the filters were washed and incubated with the enzyme substrates for colorimetric detection. The Western blot obtained by this experiment is shown in FIG. 3.

The Western blot shown in FIG. 3 demonstrates that antigens encoded by Den-1PRM15/tE CO, Den-2PRM15/tE CO, Den-3PRM15/tE CO, and Den-4PRM15/tE CO were well expressed in transfected 293 cells. However, only antigens expressed from pMaxVax10.1 vectors comprising Den-2PRM15/tE CO, Den-3PRM15/tE CO, and Den-4PRM15/tE CO appear to be secreted from transfected cells. In comparison, truncated envelope antigens expressed from wild-type dengue virus nucleotide sequences have been shown to be poorly expressed and very poorly secreted in mammalian cells.

These experiments demonstrate that nucleic acid sequences comprising Den-1PRM15/tE CO, Den-2PRM15/tE CO, Den-3PRM15/tE CO, and Den-4PRM15/tE CO sequences are capable of high levels of dengue antigen expression in transfected cells. Moreover, the results of these experiments demonstrate that select nucleic acid sequences of the invention can be used to effectively express dengue antigens as secreted proteins.

Similar methods are used to assess the expression and/or secretion of pMaxVax10.1 expression vectors comprising each of the following nucleotide sequences described above: Den-1 C15/full prM/full E CO (SEQ ID NO:231), Den-2 C15/full prM/full E CO (SEQ ID NO:232), Den-3 C15/full prM/full E CO (SEQ ID NO:233), and Den-4 C15/full prM/full E CO (SEQ ID NO:234).

Example 2

This example illustrates exemplary methods for generating libraries of nucleic acids encoding recombinant dengue antigens and screening or selecting from such libraries to identify recombinant nucleic acids that encode recombinant dengue antigens in mammalian cells.

Libraries of recombinant nucleic acid sequences were generated by recursive sequence recombination procedures using purified Den-1PRM15/tE CO, Den-2PRM15/tE CO, Den-3PRM15/tE CO, and Den-4PRM15/tE CO, e.g., as parental nucleotide sequences. The recursive sequence recombination procedures were conducted as described previously in, e.g., Stemmer (1994) Nature 370:389-391 and Crameri, A. et al. (1998) Nature 391:288-91, U.S. Pat. No. 5,837,458, and other references cited above in the section describing recursive sequence recombination, each incorporated herein by reference in its entirety for all purposes.

Briefly, each pMaxVax10.1 expression vector comprising a Den-1 PRM15/tE CO, Den-2 PRM15/tE CO, Den-3 PRM15/tE CO, or Den-4 PRM15/tE CO nucleotide sequence was cleaved by restriction enzymes flanking the dengue PRM15/tE nucleotide sequence, and each resulting nucleotide fragment were isolated by standard gel purification and subjected to recursive sequence recombination. The recombined nucleic acid products were then amplified by PCR with rescue primers located upstream of the 5′ BamHI and 3′ of the EcoRI cloning sites, digested with BamHI and EcoRI, and gel purified using standard techniques. The isolated recombinant nucleic acids were ligated into pMaxVax10.1 vectors (e.g., as described in Example 1 to form a recombinant library of recombinant DNA plasmid vectors, which were subsequently cloned in E. coli using standard library transfection techniques (see, e.g., Sambrook, supra) and according to manufacturer's instructions.

The transformed cells were plated overnight, the individual colonies were pooled, and plasmid DNA was prepared by standard purification methods (Qiagen). Plasmid DNA of the pMaxVax10.1 expression libraries was transfected into HEK-293 cells in culture using Superfect reagent (Qiagen) according to the manufacturer's instructions. Several libraries so produced were then analyzed for dengue antigen expression by flow cytometry (Fluorescence-Activated Cell Sorter—FACS analysis) with anti-dengue virus antibodies from mouse ascitic fluid.

Specifically, to perform FACS analysis, about 1×10⁵ transfected 293 cells were incubated with a mixture of the mouse anti-dengue virus antibodies (as described above), dissolved in PBS buffer containing 2% fetal calf serum (FCS). The optimal antibody dilution was evaluated on a case-by-case basis for each antibody. In general, serial test dilutions of 1:500, 1:1000, and 1:2000 were used. The cells were stained for 30 minutes on ice and washed 3 times with PBS buffer before being incubated with appropriate secondary antibodies, which were coupled with a fluorescent detection reagent (goat anti-mouse phycoerythrin conjugate, CalTag Lab). The staining concentration was determined for each labeled antibody to provide a maximal Mean Fluorescence Intensity (MFI) and minimal background signal (e.g., optimum staining concentration was the concentration per 10⁵ cells). After 30 minutes incubation on ice, the cells were washed 3 times with PBS and analyzed by FACS. Specifically, cells were analyzed using a FACSCalibur flow cytometer and CellQuest software (BDIS, San Jose, Calif.).

First round libraries of recombinant nucleic acids were prepared according to this protocol. Recombinant nucleic acids encoding dengue virus PRM15/tE fusion proteins cloned into pMaxVax vectors, wherein such nucleic acid library-transfected cells were incubated with a human dengue virus antisera, were transfected into human 293 cell cultures and analyzed by standard FACS analyses, as described above, using a program that provides a graphical output of the number of positive cells against fluorescence intensity (software settings and dyes were selected such that dead cells were excluded from the output signal as were aggregated cell masses). The recombinant nucleic acids were produced by recursively recombining the following codon optimized dengue virus antigen parental nucleotide sequences with one another: Den-1PRM15/tE CO, Den-2PRM15/tE CO, Den-3PRM15/tE CO, and Den-4PRM15/tE CO. Each library of recombinant PRM15/tE-encoding dengue virus nucleic acids comprised at least one recombinant nucleic acid that encoded a recombinant PRM15/tE dengue virus polypeptide that was bound by one or more murine anti-dengue virus antibodies. The experiments also included a population of 293 cells transfected with a pMaxVax10.1_(null) vector (FIG. 1) lacking any dengue virus nucleic acid insertion (termed a “null vector”) as a control (C) (data not shown).

Similar experiments were repeated with dengue virus antisera obtained from human patients (Immunology Consultants Laboratory Inc.—Sherwood, Oreg.) infected with dengue viruses (of unknown virus serotype(s)) (data no shown). Each library of recombinant PRM15/tE dengue virus nucleic acids also comprised at least one recombinant dengue virus nucleic acid that encoded a PRM15/tE dengue virus polypeptide that was bound by one or more human anti-dengue virus antisera.

To isolate such individual recombinant nucleic acids, individual E. coli colonies were picked from the plated libraries and inoculated into 96-well blocks containing 1.2 ml Terrific Broth-amp (50 μg/ml). The 96-well plate cultures were grown for 20 hours at 37° C., and plasmid DNA was purified using the Biorobot (Qiagen, Valencia, Calif.). HEK-293 cells were plated in 96-well plates at a density of 2×10⁴ cells per well the day prior to transfections. The cells were transfected with individual recombinant pMaxVax10.1 vectors, each comprising a recombinant dengue virus nucleic acid, and a pMaxVax10.1_(null) “null vector” as a control (which lacked a nucleic acid encoding a recombinant or WT dengue virus PRM15/tE), using a Superfect (Qiagen) transfection system according to the manufacturer's instructions. After about 48 hours incubation under conditions permissive for transgene expression, the transfected 293 cells were harvested or lysed using standard techniques, depending on the type of analysis to be performed on the cell or aspirated cell-free cell medium (supernatant) (e.g., whether the cells were subjected to FACS or Western blot analysis, as described further below).

Such techniques were used to screen or select from the libraries of recombinant polynucleotides produced by the above-described method to identify cells that comprised nucleic acids encoding polypeptides that reacted with specific dengue virus antibodies. Briefly, each well was analyzed with anti-dengue virus antibodies from mouse ascitic fluid against all four virus serotypes by FACS, as described above. Positive cells were counted in the FL2-H channel and graphically plotted. The graphical output obtained from these experiments was compared to the output obtained from similar experiments performed with the vector control (null vector). Cells that exhibited an intensity of at least about 10² were considered positive for recombinant PRM15/tE dengue virus Ag expression (data not shown).

Plasmid DNAs of pMaxVax10.1 DNA vectors corresponding to positive clones identified in such 96-well high throughput screening assays were again transfected into 293 cells, and these cells were harvested and analyzed for tetravalent antigen expression against with the four serotype specific antibodies, as described below (Example 3).

The results of these experiments demonstrate that recursive sequence recombination and appropriately devised selection/screening procedures can be applied to DEN-1, DEN-2, DEN-3, or DEN-4 PRM15/tE CO (codon-optimized) antigen-encoding nucleic acids to generate libraries of novel recombinant nucleic acids and identify therefrom specific recombinant nucleic acids comprising nucleotide sequences that encode recombinant dengue antigens that reacted with murine and/or human dengue virus antibodies.

Example 3

This example describes the production and identification of recombinant nucleic acids that encode multivalent dengue antigens using recursive sequence recombination methods and chosen selection/screening procedures.

Clone 2/7 was selected from a library of clones comprising recombinant nucleic acids produced according to Example 2, after being identified as positive for the expression of a recombinant dengue antigen by FACS analysis. The nucleic acid sequence of clone 2/7, comprising a recombinant PRM15/trunE sequence is shown in SEQ ID NO:156. (The nucleic acid sequence of clone 2/7 comprising only the recombinant truncE nucleotide segment is shown in SEQ ID NO:285.)

293 cells were transfected with pMaxVax10.1_(2/7) or pMaxVax10.1_(Den-3PRM15/tE CO) using standard techniques and according to manufacturer's instructions, as described above. The transfected cells were cultured for about 48 hours under conditions permissive for nucleotide (e.g., transgene) expression, harvested, and divided into four for separate staining reactions. Serotype specific mouse DEN-1, DEN-2, DEN-3, and DEN-4 polyclonal antisera were then added to pMaxVax10.1_(2/7) or pMaxVax10.1_(Den-3PRM15/tE CO) transfected cells, followed by an incubation with appropriately labeled secondary antibodies, and the cells subjected to FACS analyses, as described in Example 2. The results of these experiments, which are shown in FIG. 4, demonstrate that the cloned recombinant nucleic acid of E. coli clone 2/7 (clone 2/7—SEQ ID NO:156), which encodes a recombinant antigen (SEQ ID NO:65) that is expressed on the surface of mammalian cells, is reactive with antibodies of all 4 dengue virus serotypes (i.e., tetravalent).

In comparison, Den-3PRM15/tE CO expressed an antigen having a WT DEN-3PRM15/truncated E fusion protein sequence (SEQ ID NO:151) that was reactive only with antibodies against DEN-3 and also cross-reactive with anti-DEN-1 antibodies.

These experimental results demonstrate the production of cell surface recombinant antigens that are cross-reactive with antibodies to all 4 serotypes of dengue viruses by one of the inventive methods of the invention. Moreover, the experiment illustrates an effective technique for identifying recombinant nucleic acids encoding multivalent antigens from a recombinant nucleic acid library produced according to the methods described herein.

Example 4

This example illustrates the identification of recombinant multivalent dengue antigens produced according to the methods described herein by Western blot analyses.

Six representative “PMR15/tE” clones (designated 2/7, 5/21, 2G11, 6E12, 6A11, and 6D8) identified as positive for the expression of recombinant dengue antigens by the methods described in Example 2 were selected. The recombinant nucleic acid-containing pMaxVax10.1 vectors corresponding to each of these 6 clones were isolated, as described above. Eleven cultures of 293 cells were prepared. Six cultures were transfected individually with one of the six pMaxVax10.1 vectors comprising a recombinant dengue PRM15/tE nucleotide sequence. Each of the remaining five cultures was transfected individually with one of the following vectors comprising a PRM15/tE CO parental nucleotide sequence—pMaxVax10.1_(Den-1PRM15/tE CO), pMaxVax10.1_(Den-2PRM15/tE CO), pMaxVax10.1_(Den-3PRM15/tE CO), pMaxVax10.1_(Den-4PRM15/tE CO) (collectively referred to as “parental sequences” or “parent sequences”), pMaxVax10.1_(null) (vector control; no antigen-encoding sequence added)—under identical conditions. All transfections were performed using standard techniques and in accordance with manufacturers' instructions.

After about 48 hours incubation, under conditions permissive for nucleotide (e.g., transgene) expression, the transfected 293 cells were harvested, lysed and then separately subjected to polyacrylamide gel electrophoresis and Western blot analysis using mouse DEN-1, DEN-3, and DEN-4 antisera (designated α-DEN-1, α-DEN-3, and α-DEN-4, respectively) and appropriately labeled secondary antibodies, using the techniques described in Example 1. The results of these experiments are shown in FIG. 5.

FIG. 5 shows that each of the selected recombinant nucleic acids encoded a secreted recombinant antigen that reacted with antibodies to DEN-1, DEN-3, and DEN-4. At best, only some of the parental nucleotide sequence-encoded antigens showed minimal cross-reactivity with antibodies other than those with which they are normally associated (see, e.g., the reactivity of Den-1PRM15/tE CO-encoded antigen with DEN-3 antibodies), resulting in a Western blot marked by light bands compared to the consistently well-defined bands observed with the six recombinant dengue antigens. Sequencing analysis determined that 2/7 and 6A11 were encoded by identical nucleic acid sequences, as were 5/21 and 6D8 explaining the remarkably similar banding patterns observed for these antigens.

The results of these experiments demonstrate an inventive method for producing and identifying (e.g., screening/selecting) recombinant secreted dengue antigens (comprising recombinant PRM15/tE fusion proteins or related recombinant truncated E proteins lacking the PRM15 sequence) that cross-react with (or bind or specifically bind to) anti-dengue virus antigen antibodies against multiple (e.g., at least two, at least three) dengue virus serotypes in mammalian cells.

Analogous procedures are used to produce and identify recombinant secreted dengue antigens comprising, e.g., recombinant C15/full length prM/full length E fusion proteins (or related recombinant full length prM/full length E fusion proteins lacking the C15 sequence and the initial Met residue) that cross-react with (or bind or specifically bind to) anti-dengue virus antigen antibodies against at multiple dengue virus serotypes in subjects.

Example 5

This example describes methods to produce and identify (e.g., screening/selecting) recombinant dengue antigens that induce the production of antibodies to dengue viruses of multiple virus serotypes in vivo.

Nucleic acid libraries comprising recombinant polynucleotide sequences were produced using recursive sequence recombination techniques as described, e.g., in Example 2. pMaxVax10.1 plasmid vectors comprising members of the library of recombinant nucleic acid sequences (which encode PRM15/trunE dengue virus antigen fusion proteins) were constructed. In one aspect, at least sixteen recombinant antigens corresponding to 16 clones were identified as positive for the expression of tetravalent dengue antigens (e.g., dengue antigens reactive with all antibodies against all four serotypes) by FACS analysis described in Example 2. The pMaxVax10.1 plasmid vectors used for transformation of the clones were isolated and purified using the techniques described above for DNA immunization experiments, and then used for immunizations in mice as follows.

Inbred mice (BALB/c) were individually injected with endotoxin free purified (Qiagen) pMaxVax10.1 vector DNA comprising one of the sixteen recombinant nucleic acids or Den-3PRM15/tE CO. Three mice were injected with 50 μg of plasmid DNA of one of the indicated plasmid types in each leg muscle. Three mice received an identical dose of a control vector, pMaxVax10.1_(null). All of the immunized mice received booster immunizations of identical dosage as the initial DNA immunization at day 14 after the initial immunization.

40 μl of serum was collected from each immunized mouse 28 days after the initial plasmid DNA injection and analyzed for antibody induction in ELISA assays. ELISA plates (Nunc Immuno Maxisorp (Roskilde—Denmark)) were coated overnight at 4° C. with the test antigens (inactivated dengue viruses of each serotype—Den-1, Den-2, Den-3, and Den-4-supplied by Immunology Consultants Laboratory, Inc.—Sherwood, Oreg.) using standard techniques and according to the manufacturer's instructions. The plates were washed 3 times with PBS buffer containing 0.1% Tween 20 and blocked with 3% BSA/PBS/0.1% Tween20 for 1 hour at 37° C. to reduce unspecific binding. The plates were washed 3 times with PBS/0.1% Tween 20 and incubated for 1 hour at 37° C. with the anti-dengue test sera in a 1:100 dilution and after additional 3 washing steps incubated for 1 hour at 37° C. with the secondary antibodies (goat anti-mouse HRP conjugates, Amersham) at a 1:3000 dilution. The plates were finally washed 5 times with PBS/0.1% Tween 20 and incubated with TMB peroxidase substrates (Tetramethyl Benzidine, Pierce). The color reaction was stopped with 2M H₂SO₄, and the optical density (absorbance) for each sample was analyzed spectrophotometrically at 450 nanometers (nm) on an ELISA plate reader. Alternatively, ELISA assays can be performed by using other standard assay formats, including, e.g., that described in Raviprakash et al., J. Gen. Virology 81:1659-1667 (2000), which is incorporated herein by reference in its entirety for all purposes).

A least seven of the sixteen recombinant PRM15/tE dengue virus antigen-encoding nucleic acids tested—2G11 (SEQ ID NO:157), 2/7 (also termed “6A11”) (SEQ ID NO:156), 6E12 (SEQ ID NO:159), 6C6 (SEQ ID NO:160), 5/21 (also termed “6D8”) (SEQ ID NO:158), 6F4 (SEQ ID NO:161), and 7A9 (SEQ ID NO:162), were identified as encoding seven respective recombinant antigens (i.e., 2G11 (SEQ ID NO:66), 2/7 (6A11) (SEQ ID NO:65), 6E12 (SEQ ID NO:69), 6C6 (SEQ ID NO:68), 5/21 (6D8) (SEQ ID NO:67), 6F4 (SEQ ID NO:70), and 7A9 (SEQ ID NO:71) that, upon expression in vivo, produced antibodies against DEN-1, DEN-2, DEN-3, and DEN-4 that were detected in a standard ELISA assay. The signal peptide sequence is typically cleaved after transport of the tE protein into the ER.

The average optical density (OD) values for each of these seven tetravalent antigens were calculated for each serotype-specific ELISA plate tested. These values were plotted on a graph along with the average OD values observed for mice injected with pMaxVax10.1_(Den-3PRM15/tE CO) and pMaxVax10.1_(null) on each of the dengue virus serotype-specific ELISA plates (data not shown). Representative data for four of these recombinant tetravalent antigens, 2G11, 2/7, 6E12, 5/21) are shown in FIG. 6 (Example 6). In this figure, pMV refers to pMaxVax10.1_(null).

In vivo injection of each mouse with a population of pMaxVax10.1 DNA plasmid expression vectors, wherein each said population of vectors comprised one such recombinant dengue antigen-encoding nucleotide sequence, resulted in the production of antibodies that reacted with multiple serotype-specific dengue antigens in the standard ELISA. For example, antisera that were obtained from immunized mice, wherein each such mouse had been injected with a population of pMaxVax10.1 vectors, each of said population of vectors comprising one of the following recombinant DNA sequences—2G11 DNA, 2/7 (6A11) DNA, 5/21 (6D8) DNA, or 6E12 DNA, produced higher OD levels than antisera that were obtained from mice injected with a population of pMaxVax10.1_(Den-1PRM15/tE CO) expression vectors, and tested on ELISA plates, coated with inactivated DEN-1 virus. Significantly, sera obtained from mice immunized with any of the seven of these plasmids, each comprising one of the seven recombinant antigen-encoding DNA sequences, exhibited higher OD levels than sera obtained from mice immunized with pMaxVax10.1_(Den-2PRM15/tE) when analyzed on inactivated DEN-2 virus coated ELISA plates. Moreover, OD levels for sera obtained from mice immunized with pMaxVax10.1_(2/7) DNA, pMaxVax10.1_(5/21) DNA, pMaxVax10.1_(2G11) DNA, pMaxVax10.1_(6E12) DNA, were also higher than those observed for sera obtained from mice immunized with pMaxVax10.1_(Den-3PRM15/tE CO) and assayed on inactivated DEN-3 virus coated ELISA plates. OD levels for the plasmids comprising the shuffled DNA-encoded antigens pMaxVax10.1_(2G11) DNA and pMaxVax10.1_(6E12) DNA were at least comparable to those of mice injected with pMaxVax10.1_(Den-4PRM15/tE) when assayed on inactivated DEN-4 virus coated ELISA plates.

The results of this experiment demonstrate the effectiveness of recursive sequence recombination and appropriate screening/selection assays in generating and identifying nucleic acids encoding recombinant (PRM15/tE) dengue antigens that upon expression in vivo induce or promote the production of antibodies that react with (or bind or specifically bind to) dengue virus antigens of multiple dengue virus serotypes. The results also demonstrate that recombinant nucleic acids of the invention (and recombinant antigens encoded therefrom) are useful and effective in inducing or promoting the production of antibodies that react with (or bind or specifically bind to) dengue virus antigens of multiple dengue virus serotypes in vivo in subjects, including mammals.

Example 6

This example illustrates the generation of a library of recombinant nucleic acids by a second round of recursive sequence recombination, and the identification and/or isolation of recombinant nucleic acids encoding recombinant dengue antigens from the library which, when expressed in vivo, induce or enhance the production of antibodies to dengue viruses of multiple dengue virus serotypes.

The seven nucleic acids identified as encoding tetravalent antigens in Example 5 (PRM15/trunE format) were isolated, purified, and subjected to a second round of recursive sequence recombination (e.g., DNA shuffling and appropriately defined selection/screening) to produce a first second round library of recombinant nucleic acids (PRM15/trunE format), all of which were cloned in E. coli.

In one exemplary analysis, at least twenty-one recombinant nucleic acids (PRM15/trunE format) in a resulting library were identified as encoding recombinant polypeptides that reacted with dengue virus antibodies of at least three dengue virus serotypes by testing for antibody-antigen binding using the serotype specific anti-Dengue virus-types 1-4 antisera in FACS analyses performed under similar conditions as related experiments described in Examples 1-3. The twenty-one nucleic acid sequences were isolated, purified, and ligated into pMaxVax10.1 vectors, in accordance with the techniques applied in Example 1 (see, e.g., FIG. 2). The recombinant pMaxVax10.1 plasmids were cloned in E. coli to generate plasmids for the following DNA immunization experiments.

A group of three mice was injected with 100 μg of plasmid DNA for each one of the identified 21 pMaxVax10.1 plasmid vectors, each such vector comprising a shuffled DNA sequence, or with the control vector, pMaxVax10.1_(null). Each mouse received a booster immunization of the same dose of the same vector as the initial immunization at day 14, and was subsequently bled on day 28 to obtain sera, according to the method described in Example 5. Sera obtained from the immunized mice were analyzed in a 1:100 dilution in PBS for antibody induction in ELISA assays on DEN-1, DEN-2, DEN-3, and DEN-4 inactivated virus coated ELISA plates, respectively, as described in Example 5. At least twelve of the nucleic acid sequences (PRM15/truncE format), 11B1 DNA (SEQ ID NO:173), 11B8 DNA (SEQ ID NO:174), 11C11 DNA (SEQ ID NO:176), 11E2 DNA (SEQ ID NO:163), 12E3 DNA (SEQ ID NO:164), 12H4 DNA (SEQ ID NO:177), 13E2 DNA (SEQ ID NO:165), 13E11 DNA (SEQ ID NO:167), 13F11 DNA (SEQ ID NO:178), 14B1 DNA (SEQ ID NO:179), 14E9 DNA (SEQ ID NO:166), and 14H2 DNA (SEQ ID NO:181), were identified by ELISA as encoding recombinant antigens, 11B1 (SEQ ID NO:72), 11B8 (SEQ ID NO:73), 11C11 (SEQ ID NO:75), 11E2 (SEQ ID NO:76), 12E3 (SEQ ID NO:77), 12H4 (SEQ ID NO:78), 13E2 (SEQ ID NO:79), 13E11 (SEQ ID NO:80), 13F11 (SEQ ID NO:81), 14B1 (SEQ ID NO:82), 14E9 (SEQ ID NO:83), and 14H2 (SEQ ID NO:85), that induced in vivo production of antibodies that reacted with dengue virus antigens of the four virus serotypes—Den-1, Den-2, Den-3, and Den-4—at levels well above those observed with the control vector.

In a further round of experiments, mice were individually injected with 100 μg of plasmid DNA of one of the following: (1) ten representative plasmids (each comprising one of the 11B1, 11B8, 11C11, 11E2, 12E3, 12H4, 13E2, 14B1, 14E9, and 14H2 nucleotide sequences), (2) plasmids comprising four parental nucleotide sequences described in Example 1 (i.e., pMaxVax10.1_(Den-1PRM15/tE CO), pMaxVax10.1_(Den-2PRM15/tE CO), pMaxVax10.1_(Den-3PRM15/tE CO), and pMaxVax10.1_(Den-4PRM15/tE CO)), (3) plasmids comprising four select nucleic acids (2G11 DNA (SEQ ID NO:157), 6E12 DNA (SEQ ID NO:159), 5/21 DNA (SEQ ID NO:158), and 2/7 DNA (SEQ ID NO:156)), or (4) the pMaxVax10.1_(null) plasmid vector. The immunization experiments were performed in triplicate with booster immunizations and bleeding performed as described in Example 5. Sera obtained from the immunized mice were analyzed for antibody induction in ELISA assays on DEN-1, DEN-2, DEN-3, and DEN-4 inactivated virus coated ELISA plates, respectively, as described in Example 5. Average OD values for sera obtained from groups of mice, each group comprised 3 mice, which had been immunized with one type of plasmid against each type of ELISA plate were determined and graphically plotted. The results of these calculations are shown in FIG. 6. In this figure, the label “pMV” on the X axis of each plot refers to pMaxVax10.1_(null), and the labels “D-1,” “D-2,” “Den-3,” and “Den-4E” on the X axes refer to pMaxVax10.1_(Den-1PRM15/tE CO), pMaxVax10.1_(Den-2PRM15/tE CO), pMaxVax10.1_(Den-3PRM15/tE CO), and pMaxVax10.1_(Den-4PRM15/tE CO), respectively.

The results of these experiments, as shown in FIG. 6, demonstrate that all of the selected nucleic acids expressed antigens that induced antibodies in mice that strongly reacted with all four inactivated WT dengue viruses (Den-1, Den-2, Den-3, and Den-4) in ELISA assays. Sera obtained from mice injected with plasmids comprising the selected second round library nucleic acid sequences exhibited higher average OD values on DEN-1, DEN-2, and DEN-3 ELISA plates than did the most related parental sequence; for example, sera from mice immunized with the second round recombinant nucleic acids had higher average OD values on the DEN-1 plate than the average OD of mouse sera obtained from mice immunized with pMaxVax10.1_(Den-1PRM15/tE CO) (labeled as D-1 in FIG. 6). Furthermore, sera from mice that received injections of pMaxVax10.1 plasmid DNA comprising at least 4 of the first second round nucleic acids—11C1-encoding DNA, 11B1 DNA, 11B8 DNA, and 14B1 DNA—as well sera obtained from mice that received injections of at least pMaxVax10.1_(6E12) DNA, exhibited OD levels comparable to those levels observed with sera from mice that received injections of pMaxVax10.1_(Den-4PRM15/tE CO) analyzed on DEN-4 specific ELISA plates.

The results of these experiments further demonstrate generation and identification of recombinant nucleic acids encoding recombinant (PRM15/tE) dengue antigens that induce or enhance production of antibodies that react with (or bind or specifically bind to) multiple dengue virus serotypes in vivo (as identified by the selecting screening methods described herein). The results of these experiments also confirm that the nucleic acids and plasmid vectors of the invention (and the resulting recombinant polypeptides encoded therefrom) are capable of and useful for inducing and enhancing such an immune response(s) in vivo in subjects, including mammals.

Example 7

This example illustrates the complex chimerism (i.e., sequence diversity) of select recombinant antigens of the invention where the amino acid sequences of such recombinant antigens are compared to amino acid sequences of corresponding WT dengue virus antigens.

The antigens corresponding to the recombinant nucleotide sequences of the seven clones identified and selected as examples in the first round recombinant nucleotide library in Example 5 (i.e., 2/7 (also termed “6A11”), 5/21 (also termed “6D8”), 2G11, 6E12, 6C6, 6F4, and 7A9) and the antigens corresponding to the recombinant nucleotide sequences of the 12 clones identified and selected in the first second round recombinant nucleotide library in Example 6 (11B1, 11B8, 11C11, 11E2, 12E3, 12H4, 13E2, 13E1, 13F11, 14B1, 14E9, and 14H2) were sequenced and compared with the polypeptide sequences of the corresponding sequence regions (i.e., the PRM15 and truncated E protein (e.g., about 90% of the N terminus of the E protein)) of each wild-type dengue antigen for each of the 4 dengue serotypes to determine regions of amino acid sequence identity between the recombinant and wild-type antigens using standard techniques. Through such analysis, it was determined that the recombinant antigens included, e.g., various amino acid regions, fragments, or segments from the wild-type PRM15/truncated envelope protein polypeptide sequence for each of the 4 WT dengue virus serotypes. Approximate amino acid regions, fragments or segments in a recombinant antigen corresponding to a region, fragment or segment of a wild-type antigen amino acid sequence were graphically plotted to assess chimerism of the recombinant antigens (data not shown) (see, however, the exemplary plot shown in FIG. 11). Additionally, it was noted that diversity increased with additional recursive sequence recombination, e.g., via DNA shuffling in combination with appropriate screening or selection procedures.

The above-described amino acid sequence analysis illustrates the complex chimeric nature (sequence diversity) of at least many of the recombinant antigens of the invention and nucleic acids encoding them. Moreover, the results of this sequence analysis establish that greater diversity (more complex chimerism) is induced in recombinant antigens encoded by DNAs produced and identified via multiple rounds of recursive sequence recombination combined with appropriate screening/selection procedures.

Example 8

This example describes the generation of a library of recombinant nucleic acids by recursive sequence recombination, the identification of select recombinant nucleic acids in such library using appropriate screening/selection procedures, and the isolation of selected nucleic acids encoding recombinant dengue antigens from the library, which, when expressed in vivo, induced or enhanced the production of antibodies that reacted with dengue viruses of multiple virus serotypes.

Representative recombinant dengue-antigen-encoding nucleic acid sequences from first round library clones 2G1, 6E12, 2/7 (6A11), and 5/21 (6A8), described in Example 5, were isolated and purified using standard techniques and, in combination with the 4 parental dengue sequences described in Example 1, used to produce a new library of recombinant nucleic acids by recursive sequence recombination in accordance with the techniques described and/or referenced in Example 2 and throughout the specification.

From this library of recombinant nucleic acids, twenty-five recombinant nucleotide sequences (in PRM15/tE format)) were selected via flow cytometry (FACS) for immunization of mice and ELISA analyses of the sera using four serotype-specific inactivated dengue virus coated ELISA plates following the techniques described and referenced in Example 5. All 25 recombinant nucleotide sequences were analyzed as such and found to induce or enhance production of cross-reactive antibodies against all 4 DEN serotypes as discussed in detail below.

Specifically, twenty-five recombinant DNA sequences (in PRM15/tE format)—(15C2 DNA (SEQ ID NO:182), 15D4 DNA (SEQ ID NO:183), 15H4 DNA (SEQ ID NO:184), 16B4 DNA (SEQ ID NO:185), 16E8 DNA (SEQ ID NO:168), 16E10 DNA (SEQ ID NO:169), 16F12 DNA (SEQ ID NO:186), 16G11 DNA (SEQ ID NO:187), 17A12 DNA (SEQ ID NO:188), 17D5 DNA (SEQ ID NO:189), 17D11 DNA (SEQ ID NO:190), 17F5 DNA (SEQ ID NO:191), 17F11 DNA (SEQ ID NO:192), 17G5 DNA (SEQ ID NO:193), 17H3 DNA (SEQ ID NO:194), 17H10 DNA (SEQ ID NO:195), 17H12 DNA (SEQ ID NO:196), 18A9 DNA (SEQ ID NO:197), 18B7 DNA (SEQ ID NO:198), 18D7 DNA (SEQ ID NO:199), 18E9 DNA (SEQ ID NO:170), 18E10 DNA (SEQ ID NO:171), 18E11 DNA (SEQ ID NO:172), 18H2 DNA (SEQ ID NO:200), and 18H6 DNA (SEQ ID NO:235))—were isolated from the new library of recombinant nucleic acids using techniques as described in Examples 1-7. A pMaxVax10.1 plasmid vector comprising each such recombinant DNA sequence was prepared as described above.

Mice were divided into groups of three. Three mice were each immunized by injection with one of the following DNA constructs: (1) 100 μg of pMaxVax10.1 plasmid DNA vector comprising one of the 25 recombinant sequences described above; (2) 100 μg of pMaxVax10.1 plasmid vector comprising the recombinant DNA sequence corresponding to clone 11C4 (SEQ ID NO:175) identified in a first second round library (see Example 5); (3) 100 μg of pMaxVax10.1 plasmid vector comprising the recombinant DNA sequence corresponding to clone 14G10 DNA (SEQ ID NO:180) identified in a first second round library (see Example 5), and (4) 100 μg of pMaxVax10.1_(null) vector (control). Each mouse received a booster of the same dose of the same plasmid vector as the initial immunization at days 14, 29, and 56 following the initial immunization (day 0). Sera were collected from the mice at days 28, 55, and 76. The collected sera were analyzed for in vivo antibody induction in ELISA assays on DEN-1, DEN-2, DEN-3, and DEN-4 inactivated virus coated ELISA plates in a 1:100 dilution under conditions described in Example 5.

The ELISA analyses of the mouse sera indicated that all 27 plasmids (each comprising one of the PRM15/tE recombinant nucleotide sequences described above) lead to in vivo expression of antigens that induced production of antibodies that reacted with inactivated dengue viruses of all 4 virus serotypes in ELISA assays at both 28 and 55 days after injection. The average optical density (OD) value for each recombinant PRM15/tE antigen encoded by such recombinant plasmid vector was calculated for the sera obtained from each mice receiving such plasmid injection(s) for each serotype-specific ELISA plate tested, as compared to that value obtained using pMaxVax10.1_(null) (data not shown). The 25 recombinant PMR15/tE polypeptide antigens encoded by the recombinant nucleotide sequence identified in the new library included the following: (15C2 (SEQ ID NO:86), 15D4 (SEQ ID NO:87), 15H4 (SEQ ID NO:88), 16B4 (SEQ ID NO:89), 16E8 (SEQ ID NO:90), 16E10 (SEQ ID NO:91), 16F12 (SEQ ID NO:92), 16G11 (SEQ ID NO:93), 17A12 (SEQ ID NO:94), 17D5 (SEQ ID NO:95), 17D11 (SEQ ID NO:96), 17F5 (SEQ ID NO:97), 17F11 (SEQ ID NO:98), 17G5 (SEQ ID NO:99), 17H3 (SEQ ID NO:100), 17H10 (SEQ ID NO:101), 17H12 (SEQ ID NO:102), 18A9 (SEQ ID NO:103), 18B7 (SEQ ID NO:104), 18D7 (SEQ ID NO:105), 18E9 (SEQ ID NO:106), 18E10 (SEQ ID NO:107), 18E11 (SEQ ID NO:108), 18H2 (SEQ ID NO:109), and 18H6 (SEQ ID NO:110).

A further in vivo analysis was performed by injecting groups of 5 mice each under identical conditions with 100 μg (50 μg/leg) of pMaxVax10.1_(null) or with 100 μg pMaxVax10.1 DNA vector comprising one of the following nucleotide sequences: 16B4 (SEQ ID NO:185), 16G11 (SEQ ID NO:187), 18H2 (SEQ ID NO:200), 18H6 (SEQ ID NO:235), Den-1PRM15/tE CO (SEQ ID NO:211), Den-2PRM15/tE CO (SEQ ID NO:212), Den-3PRM15/tE CO (SEQ ID NO:213), Den-4PRM15/tE CO (SEQ ID NO:214). Each mouse of another group of 5 mice was injected with 100 μg (50 μg/leg) of a mixture of pMaxVax10.1_(Den-1PRM15/tE CO), pMaxVax10.1_(Den-2PRM15/tE CO), pMaxVax 10.1_(Den-3PRM15/tE CO), and pMaxVax10.1_(Den-4PRM15/tE CO) at the same times and under the same conditions as described for the other groups of mice that received individual injections of each of the above-described plasmid DNA vectors. Each mouse received a booster immunization of the same dose of the same plasmid vector as the initial immunization at days 14, 29, and 56 following the initial immunization (day 0). Sera were collected from the mice at days 28, 55, 76, 120 and 180. Average OD values from sera at day 55 obtained for the experiments are shown in FIG. 7.

The results of these further ELISA assays indicate that the recombinant PRM15/tE antigens (16B4 (SEQ ID NO:89), 16G11 (SEQ ID NO:93), 18H2 (SEQ ID NO:109), 18H6 (SEQ ID NO:110)) encoded by the selected recombinant nucleotides (16B4 (SEQ ID NO:185), 16G11 (SEQ ID NO:187), 18H2 (SEQ ID NO:200), 18H6 (SEQ ID NO:235)) induced production of antibodies that reacted with dengue viruses of all 4 WT dengue virus serotypes. OD levels exhibited by antisera from mice injected with these selected recombinant clones were significantly higher than those produced by antigens (Den-1PRM15/tE (SEQ ID NO:149), Den-2PRM15/tE (SEQ ID NO:150), Den-3PRM15/tE (SEQ ID NO:151), Den-4PRM15/tE (SEQ ID NO:152)) encoded by any of the parental nucleotide sequences (Den-1PRM15/tE CO (SEQ ID NO:211), Den-2PRM15/tE CO (SEQ ID NO:212), Den-3PRM15/tE CO (SEQ ID NO:213), Den-4PRM15/tE CO (SEQ ID NO:214)), individually or in combination, for the DEN-1, DEN-2, and DEN-3 virus coated ELISA plates. Antisera from mice injected with plasmids comprising these recombinant antigen-encoding sequences also had OD levels at least as high as any of these parental antigen-encoding sequences against DEN-4 (individually or in combination) (high OD levels exhibited by sera obtained from pMaxVax10.1_(Den-3PRM15/tE CO)-injected mice on such plate may suggest a flaw in the ELISA plate used for this portion of the experiment).

The results of these experiments further demonstrate the ability of the recombinant antigens of the invention, and/or recombinant nucleic acids of the invention that encode recombinant antigens, to induce or enhance production of antibodies that react with (or bind or specifically bind to) dengue viruses of all four serotypes in vivo. Such recombinant antigenic polypeptide sequences of the invention, and such recombinant nucleotide sequences encoding recombinant antigenic polypeptides of the invention, are useful, e.g., in prophylactic and/or therapeutic methods of the invention for the induction, modulation, and/or enhancement of the production of antibodies that react with (or bind or specifically bind to) dengue viruses of all four serotypes and/or diagnostic assays to detect the presence of antibodies in a biological sample to 1, 2, 3, and 4 dengue virus serotypes.

Example 9

This example describes the ability of recombinant antigens of the invention to produce, enhance, modulate, and/or promote a neutralizing antibody response(s) against dengue viruses of multiple dengue virus serotypes in vivo.

Mice were individually injected with a pMaxVax10.1 plasmid vector comprising a recombinant DNA sequence corresponding to one of the following—18E9 (SEQ ID NO:170), 18D7 (SEQ ID NO:199), 16G11 (SEQ ID NO:187), 18H6 (SEQ ID NO:235), 15D4 (SEQ ID NO:183), 18H2 (SEQ ID NO:200), 6E12 (SEQ ID NO:159), 2/7 (SEQ ID NO:156), 2G11 (SEQ ID NO:157), and 16B4 (SEQ ID NO:185)—according to the methods set forth in Example 8. These DNA sequences encoded the following recombinant PRM15/tE antigens, respectively: 18E9 (SEQ ID NO:106), 18D7 (SEQ ID NO:105), 16G11 (SEQ ID NO:93), 18H6 (SEQ ID NO:110), 15D4 (SEQ ID NO:87), 18H2 (SEQ ID NO:109), 6E12 (SEQ ID NO:69), 2/7 (SEQ ID NO:65), 2G11 (SEQ ID NO:66) and 16B4 (SEQ ID NO:89). In a similar experiment DEN-1, DEN_(—)2, DEN-3, and DEN-4 wild type sequences, coding for the DEN-1-4 wild-type PRM15/tE antigens, and a equal mix of these 4 wild-type antigens were injected. All mice received 3 booster injections with the same plasmid DNA in 2-week intervals.

Mice were also individually injected with a pMaxVax10.1 plasmid vector comprising a recombinant DNA sequence corresponding to one of the following—5/21-D1 (SEQ ID NO:201), 2G11-D4 (SEQ ID NO:204), and 6E12-D4 (SEQ ID NO:202). These DNA sequences encoded the following recombinant full length C15/full prM/full length E antigens, respectively: following—5/21-D1 (SEQ ID NO:140), 2G11-D4 (SEQ ID NO:139), and 6E12-D4 (SEQ ID NO:141). In a similar experiment DEN-1, DEN-2, DEN-3, and DEN-4 wild type sequences, coding for the wild-type C15/full prM/full length E antigens, and a equal mix of these 4 wild-type antigens were injected. All mice received 3 booster injections with the same plasmid DNA in 2-week intervals.

Antisera obtained from the mice 76 days after initial DNA injection were analyzed by standard plaque reduction neutralization titer (PRNT) assay, which is well known to those of ordinary skill in the art (see, e.g., Russell et al., J Immunol (1967) 99:285-290; Simmons et al, Am. J. Trop. Med. Hyg (2001) 65:420-426), each incorporated herein by reference in its entirety for all purposes.

Briefly, this PRNT₅₀ assay is typically conducted as follows: Cell cultures of monkey kidney cells (LLC-MK2) are seeded in 6 well culture plates and incubated at 37° C. in a CO₂ incubator. Each of the antisera obtained from the mice injected with one of the 9 plasmid vectors is diluted to make 1:20, 1:40, and 1:80 serial dilutions. Each cell culture is incubated with a mixture of (i) dengue viruses of each of the four 4 dengue virus serotypes and (ii) a 1:20, 1:40, or 1:80 serial dilution of antisera for 2 to 3 hours.

After incubation of 2-3 hours, the inoculum mixture of dengue viruses and diluted antisera are removed from the LLC-MK2 monkey kidney cell cultures, and a layer of agarose (SeaPlaque agarose, FMC Bioproducts) is added to the cell cultures. Plaques formed by the released virus progeny are visualized at day 7 by staining with a 0.02% neutral red/Hanks balanced salt solution. The plaque counts for each such cell culture are compared to plaque counts for LLC-MK2 cells incubated with an identical mixture of dengue viruses without any antisera. The specific determination of 50% plaque reduction neutralization titers (PRNT₅₀) for the cultures is facilitated by use of Probit analysis software (SSPS, Inc. Chicago, Ill.), using standard techniques and according to manufacturer's instructions. The 50% effective dose (ED₅₀) is the serum dilution that caused a 50% reduction in the number of plaques. The amount of antiserum (e.g., serum from a subject containing specific antibodies produced by immunization with a specific immunogen) required to neutralize 50% of the infectious virus particles included in a specific virus challenge dose is directly related to the potency of the antiserum. Russell et al., supra, at 286.

FIG. 8A shows the results of PRNT₅₀ analyses for these 10 recombinant PRM15/tE antigen-encoding DNA sequences (e.g., clones 18E9, 18D7, 16G11, 18H6, 15D4, 18H2, 6E12, 2/7, 2G11, and 16B4), the four DEN wild-type PRM15/tE antigens (DEN-1, DEN-2, DEN-3, and DEN-4) individually, and an equal mix of these four DEN wild-types. FIG. 8B shows the results of PRNT₅₀ analyses for these 3 recombinant full length C15/full prM/full length E antigen-encoding DNA sequences (e.g., clones 5/21-D1, 2G11-D4, and 6E12-D4), the four DEN wild-type C1S/full prM/full length E antigens (DEN-1, DEN-2, DEN-3, and DEN-4) individually, and an equal mix of these four DEN wild-types. Reciprocal PRNT₅₀ titers of >20 were regarded as positive for production of neutralizing antibodies against a particular dengue virus serotype used for the in vitro neutralization assay. Applying this standard to the PRNT₅₀ titers presented in FIGS. 8A and 8B, all 13 of these recombinant plasmid vectors induced production of neutralizing antibodies against at least 2 dengue viruses (e.g., against at least two of DEN-1, DEN-2, DEN-3, and/or DEN-4). For example, injection of mice with pMaxVax10.1_(18E9), induced neutralizing antibodies against at least DEN-1 and DEN-2. Injection of mice with pMaxVax10.1_(18D7), pMaxVax10.1_(15D4), and pMaxVax10.1_(6E12) induced neutralizing antibodies against at least DEN-1, DEN-2, and DEN-3. Injection of mice with pMaxVax10.1_(2/7), pMaxVax10.1_(2G11), pMaxVax10.1_(16G11), pMaxVax10.1_(18H6), pMaxVax10.1_(18H2), and pMaxVax10.1_(16B4), pMaxVax10.1_(5/21-D1), pMaxVax10.1_(2G11-D4), and pMaxVax10.1_(6E12-D4) induced production of neutralizing antibodies against all four dengue virus serotypes, DEN-1, DEN-2, DEN-3, and DEN-4. In contrast, only a mix of the wild-type DEN-1, DEN-2, DEN-3, and DEN-4, coding for the recombinant full length C15/full prM/full length E antigens, induced production of neutralizing antibodies against all four dengue virus serotypes, DEN-1, DEN-2, DEN-3, and DEN-4. The two DEN wild-type PRM15/tE antigens DEN-1, and DEN-2, as well as a mix of the four DEN wild-type PRM15/tE antigens (DEN-1, and DEN-2, DEN-3, and DEN-4), and the two DEN-wild-type full length C15/full prM/full length E DEN-1, and DEN-2 antigens, did not induce neutralizing antibodies. The two DEN wild-type PRM15/tE antigens DEN-3, and DEN-4 induced only neutralizing antibodies against the homologous DEN-virus (DEN-1 and DEN-2, respectively), while the two DEN wild-type full length C15/full prM/full length E antigens, DEN-3, and DEN-4, induced neutralizing antibodies against at least 3 Den viruses (DEN-2, DEN-3, and DEN-4).

ELISA analyses were also performed with sera obtained from mice injected with 100 μg pMaxVax10.1 plasmid vector comprising a recombinant nucleotide sequence corresponding of one of each of these 13 clones (clones 18E9, 18D7, 16G11, 18H6, 15D4, 18H2, 6E12, 2/7, 2G11, 16B4, 5/21-D1, 2G11-D4, and 6E12-D4) and sera obtained from mice injected with pMaxVax10.1_(null) using DEN-1, DEN-2, DEN-3, and DEN-4 virus coated ELISA plates and DNA immunization techniques according to the methods described in Example 8. The resulting ELISA data demonstrated that all 13 of these recombinant clones induced production of antibodies in vivo that reacted with all four dengue virus serotypes in vitro (data not shown).

The results of these experiments demonstrate that the recombinant nucleotide sequences encoding recombinant dengue antigens (for the PRM15/tE format and the C15/full prM/full length E format) of the invention and/or recombinant dengue antigens ((for the PRM15/tE format and the C15/full prM/full length E format)) of the invention induced, enhanced, promoted, and/or modulated production of neutralizing antibodies to at least 2, 3, or even 4 dengue virus serotypes when such recombinant antigenic polypeptides are expressed in vivo. Such recombinant antigenic polypeptides of the invention, and such recombinant nucleotide sequences encoding recombinant antigenic polypeptides of the invention, are useful, e.g., in prophylactic/and or therapeutic methods of the invention for the induction, modulation, and/or enhancement of the production of neutralizing antibodies to at least 2, 3, or even 4 dengue virus serotypes when such recombinant antigenic polypeptides are expressed in vivo.

Example 10

This example illustrates the secretion characteristics of recombinant dengue antigens determined to induce, enhance, promote, and/or modulate neutralizing antibody production against dengue viruses of at least 2 serotypes in vivo.

The plasmid DNA corresponding to the pMaxVax10.1 vector comprising the recombinant nucleotide sequence of each of clones 18D7, 18E11, 16G11, 18H6, 18H2, 16B4, 14G10, 18E9, and 18E10 was isolated and purified by standard techniques and used to transfect 293 cells as described above and in Example 1 and 2. The transfected cells were cultured for 72 hours, and 15 μl of the unconcentrated cell-free medium supernatants subjected to polyacrylamide gel electrophoresis and membrane blotting. The nitrocellulose membranes were incubated with the anti-DEN-1, DEN-3, and DEN-4 antibodies from mouse ascitic fluid and appropriate enzyme-conjugated secondary antibodies to produce a Western blot using the techniques described above and in Example 1. FIG. 9 is a visualization of the Western blot obtained by this technique; the number (N) of wild-type dengue virus serotypes neutralized by the respective recombinant clones, as discussed in Example 9, is indicated on the bottom of the Western blot.

As shown in FIG. 9, 5 of the recombinant clones (18E9, 18E10, 18E11, 18H12, and 18H6) expressed and secreted recombinant antigens that produced dark bands on the Western blot, whereas three recombinant antigens (14G10, 16B4, 18D7) were well secreted, but not strongly bound by the antibodies, resulting in weaker bands. One recombinant clone (16G11) was not secreted at a detectable level in the unconcentrated supernatants. A comparison of the number of WT dengue virus serotypes neutralized by each recombinant antigen with its secretion profile indicated that there was no direct relationship between recombinant antigen secretion and the number of WT dengue virus serotypes neutralized by antibodies induced by the respective recombinant antigens. For example, the recombinant antigen that was not detectably secreted (16G11) produced neutralizing antibodies against all four WT dengue virus serotypes. The well-expressed and secreted recombinant antigens that produced dark bands (18E9, 18E10, 18E11, 18H12, and 18H6) produced neutralizing antibodies against two, two, four, four, and four WT dengue virus serotypes, respectively. The recombinant antigens that produced weaker bands (14G10, 16B4, 18D7) produced antibodies that neutralized two, four, and three 2 WT dengue virus serotypes, respectively.

The results of these experiments demonstrate that both secreted and cell membrane bound recombinant antigens of the invention (and the recombinant nucleotide sequences encoding such recombinant antigens) are effective in inducing, enhancing, promoting, and/or modulating production of neutralizing antibodies against one or more WT dengue virus serotypes in vivo.

Example 11

This example illustrates the longevity of the antibody response to recombinant dengue virus antigens induced by in vivo injection of DNA plasmids comprising recombinant nucleotide sequences of the invention that encode recombinant antigens of the invention.

An immune response against an infectious agent (e.g., protective immune response) preferably is one that is long-lasting. To test the longevity of the antibody immune response in a subject (e.g., mammal) receiving one or more injections of DNA plasmids encoding recombinant dengue antigens of the invention that induce neutralizing Ab responses against at least 2 dengue virus serotypes, the following experiments were performed.

Three mice each were injected with a pMaxVax10.1 DNA plasmid vector, wherein each such vector encoded a recombinant antigen corresponding to one of each of the following eight recombinant clones—16B4 (SEQ ID NO:89), 16G11 (SEQ ID NO:93), 18D7 (SEQ ID NO:105), 18E9 (SEQ ID NO:106), 18E10 (SEQ ID NO:107), 18E11 (SEQ ID NO:108), 18H2 (SEQ ID NO:109), or 18H6 (SEQ ID NO:110), at set time intervals. (Alternatively, mice each were injected with a pMaxVax10.1 DNA plasmid vector, wherein each such vector comprising a nucleic acid corresponding to one of each of the following eight recombinant clones 16B4 (SEQ ID NO:185), 16G11 (SEQ ID NO:187), 18D7 (SEQ ID NO:199), 18E9 (SEQ ID NO:170), 18E10 (SEQ ID NO:171), 18E11 (SEQ ID NO:172), 18H2 (SEQ ID NO:200), or 18H6 (SEQ ID NO:235). Antisera were obtained from these mice using techniques described above (e.g., Example 5). Sera collected from the mice at 55 and 120 days, respectively, after initial DNA immunization (day 0) by injection with these recombinant pMaxVax10.1 plasmid vectors were subjected to ELISA analyses, as described above, using, e.g., inactivated DEN-2 virus coated ELISA plates (all of these recombinant clones had previously been shown to induce a tetravalent antibody response in similar DNA plasmid injection experiments in vivo). The average OD values for antisera of mice injected with each type of plasmid were calculated at both test periods. The antibody response induced by each recombinant clone tested remained high at 55 days after the initial injection of the corresponding recombinant pMaxVax10.1 plasmid vector (comprising the recombinant nucleic acid encoding the recombinant antigen). Remarkably, the antibody responses induced by injection of these recombinant pMaxVax10.1 vectors were substantially unchanged at 120 days after the initial pMaxVax10.1 vector injection as compared to 55 days after the initial pMaxVax10.1 vector injection. For example, antibody responses induced by injection of these recombinant vectors were increased or decreased within a range of from about 0.5%, about 1%, about 2%, about 5%, about 7%, about 10%, about 12%, or about 14%, at 120 days after the initial injection as compared to 55 days after the initial injection.

The results of these experiments demonstrate that recombinant antigens of the invention, and recombinant nucleic acids encoding such antigens, are capable of inducing and/or promoting an in vivo antibody response against multiple WT dengue virus serotypes over sustained periods of time, including e.g., over at least about 55 and 120 days.

Such antibody response(s) may be induced and/or promoted, e.g., by: (1) in vivo or ex vivo administration to a subject of a recombinant DNA plasmid vector comprising a nucleotide sequence that encodes a recombinant antigen of the invention (or a recombinant DNA plasmid vector comprising a recombinant nucleotide sequence of the invention) in an amount sufficient or effective to induce or promote such desired antibody response(s); or (2) by in vivo or ex vivo administration to a subject or cells of the subject of a recombinant antigen (or nucleic acid encoding such antigen) of the invention, or chimeric virus or VLP of the invention, in an amount effective to induce or promote such desired antibody response(s). Such antibody responses are also observed in in vitro or ex vivo assays using antisera obtained from such subjects. The desired Ab response may be, e.g., an antibody response that is sufficient for prophylactic and/or therapeutic treatment of a disease or disorder (including as, e.g., a prophylactic agent or vaccine against dengue infection or dengue fever). Administration of such recombinant DNA plasmid or recombinant antigenic polypeptide to a subject may be according to any in vivo or ex vivo method for delivery or administration of a nucleic acid or polypeptide (or pharmaceutical composition thereof) to a subject as described herein and throughout this specification, including, but not limited to, e.g., injection or gene gun delivery, and including dosages and/or compositions described herein, which may be dependent upon the particular application or treatment method of interest.

Example 12

This example illustrates the production of recombinant dengue virus antigens, and recombinant nucleic acids encoding recombinant dengue antigens, that induce the production of neutralizing antibodies against multiple dengue virus serotypes in vivo, wherein each such a recombinant dengue virus antigen comprises an amino acid sequence having a length (in amino acid residues) identical, substantially identical (e.g., having at least about 75%, 80%, 85%, 86%, 87%, 88% or 89%, preferably at least about 90%, 91%, 92%, 93%, or 94%, and more preferably at least about 95% (e.g., about 87-95%), 96% 97%, 98%, 99%, 99.5% identity in length), equivalent, or substantially equivalent to (e.g., at least about 95%, about 96%, about 97%, about 98%, about 99% or more identical in length to) the length of a fusion protein comprising or consisting of the full length amino acid sequence of a prM protein fused to the full length amino acid sequence of an envelope (E) protein of a dengue virus of a particular serotype. In another aspect, each such recombinant dengue virus antigen is encoded by a nucleotide sequence having a length equivalent or substantially equivalent to (e.g., at least about 95%, about 96%, about 97%, about 98%, about 99% or more identical in length to) the length of a nucleotide sequence encoding a fusion protein comprising or consisting of the full length prM protein sequence fused to the full length E protein sequence of a specific dengue virus serotype.

As described above, for selected methods of recursive sequence recombination, the following codon optimized dengue nucleotide sequences were used as parental sequences: Den-1 PRM15/tE CO, Den-2 PRM15/tE CO, Den-3 PRM15/tE CO, or Den-4 PRM15/tE CO. For each dengue virus serotype, each such parental nucleotide sequence comprised a codon optimized nucleotide sequence encoding a wild-type dengue fusion protein comprising: 1) a “PRM15 polypeptide” (e.g., a polypeptide sequence comprising an initial methionine (Met) residue and the last 15 amino acids of the C terminus of the prM protein of a WT dengue virus of a specific serotype, e.g., DEN-1); and 2) a truncated E protein, wherein the truncated E protein comprised from at least about 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, or 95% or more of the contiguous amino acid residues of the full length E protein sequence of the same (WT) dengue virus serotype, as measured or beginning from about the N terminal amino acid residue of the E protein sequence. That is, the truncated E protein comprised a sequence of contiguous amino acid residues of at least about 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, or 95% or more of the contiguous amino acid residues of the full length E protein sequence of the same (WT) dengue virus serotype, as measured from about the N terminal amino acid residue of the E protein sequence.

For each wild-type dengue virus of one of the four serotypes, a codon optimized nucleotide sequence encoding a fusion protein comprising the full length prM protein and full length E protein of that serotype were made. In one aspect, each such codon optimized nucleotide sequence was made via nucleotide extension of the truncated parental Den-1 PRM15/tE CO, Den-2 PRM15/tE CO, Den-3 PRM15/tE CO, or Den-4 PRM15/tE CO nucleotide sequence, respectively. To generate a nucleotide sequence encoding a fusion protein comprising a full length prM protein fused to a full length E protein (or, e.g., to generate a nucleotide sequence comprising a full length prM nucleotide sequence/full length E nucleotide sequence) via nucleotide extension for each of the codon optimized truncated parental nucleotide sequences (e.g., Den-1 PRM15/tE CO, Den-2 PRM15/tE CO, Den-3 PRM15/tE CO, and Den-4 PRM15/tE CO), codon optimized nucleotide sequences encoding the full length prM protein sequence and full length E protein sequence (i.e., the amino acid residues of the C terminus of E protein needed to be added to the corresponding truncated E protein of the particular dengue virus serotype to make a full length E protein) were determined for each dengue virus serotype (Den-1, Den-2, Den-3, and Den-4) in accordance with the methods described in Example 1. Each codon optimized truncated parental nucleotide sequence (e.g., Den-1 PRM15/tE CO, Den-2 PRM15/tE CO, Den-3 PRM15/tE CO, or Den-4 PRM15/tE CO nucleotide sequence) was analyzed for unique restriction site(s) at the 5′ and 3′ ends of the nucleic acid sequence of the E gene.

By performing such an analysis, it was determined that the codon optimized E genes for all 4 dengue virus serotypes included a unique BsrBI restriction site at about position 57 in the E gene for each particular virus. The BsrBI restriction site also was found in the respective recombinant dengue antigen-encoding nucleotide sequences of clones 5/21, 2/7, 2G11, and 6E12, facilitating 5′ extension of these nucleotide sequences with additional nucleic acid residues necessary to make a corresponding full length (complete) prM gene sequence of a particular serotype, as further described below.

For each dengue virus of the 4 dengue virus serotypes, codon optimized nucleotide sequences also were generated for the full length prM gene and the nucleotide sequence encoding a 16-amino acid sequence comprising Met as the first amino acid residue and the (last) 15 amino acid residues from the C terminus of the capsid (C) gene of the particular dengue virus serotype. Such 16-amino acid sequence served as a signal sequence. Codon optimized prM genes of the dengue viruses of the 4 serotypes, including the nucleotide sequence encoding such 16 amino acid signal sequence, and the 5′ sequences of the E gene overlapping the BsrB1 site, were synthesized by oligonucleotide assembly, as described in Example 1.

Sequence analysis also was able to identify unique restriction sites positioned near the 3′ end of the various dengue E genes. Specifically, a unique BsaBI site was identified at position 1793 in the codon optimized DEN-1 prM/E gene sequence; a unique SexAI site was identified at position 1793 in the codon optimized DEN-2 prM/E gene sequence; a unique BglII site was identified at position 1863 in the codon optimized DEN-3 prM/E gene sequence; and a unique BsrGI site was identified at position 1884 in the codon optimized DEN-4 prM/E gene sequence.

For example, to extend the nucleic acid sequence of Den-4 PRM15/tE CO to a length equivalent or substantially equivalent to the length of a full length C15/full prM/full length E protein fusion protein, a human codon optimized nucleotide sequence corresponding to the 3′ end of the DEN-4 E gene encoding the “C terminal extension” (e.g., a CO sequence comprising the nucleotide residues needed to be added to a truncated E gene to extend the truncated E gene to make a full length E gene) was also synthesized by standard nucleotide synthesis techniques. The synthesized codon optimized Den-4 C15/prM fragment (comprising a sequence of 16 amino acids—15 amino acids of the C terminal of the C protein and the first Met—and the additional N terminal nucleotide residues needed to extend the codon PRM15 nucleotide sequence of a particular serotype to make a full length CO C15/full prM nucleotide sequence) was digested with BsrBI, the synthesized 3′ end of DEN-4 E gene (encoding such codon optimized C terminal extension nucleotide sequence) was digested with BsrG1, and the Den-4 PRM15/tE CO nucleotide sequence was digested with BsrBI and BsrG1 according to manufacturer's instructions. The three resulting fragments were ligated by standard techniques to extend the PRM15 and truncated E gene sequences to form a codon optimized Den-4 full length C15/full prM/full length E nucleotide sequence, designated “Den-4 C15/prM/E.” The same methods were used to synthesize codon optimized C15 signal sequence/full length prM/full length E nucleotide sequences corresponding to each of Den-1, Den-2, and Den-3, respectively, using the specific unique restriction sites discussed above. The term “C15 signal sequence” typically refers to a 16 amino acid sequence that includes as the first codon a codon encoding a Met residue and subsequent codons that encode the 15 amino acids at the C terminus of the capsid protein of a dengue virus of one of the four serotypes. The resulting “full-length” codon optimized C15/full prM/full E nucleotide sequences code for each of the parental Den-1 C15/full prM/full E, Den-2 C15/full prM/full E, and Den-3 C15/full prM/full E polypeptides and were termed Den-1 C15/full prM/full E, Den-2 C15/full prM/full E, Den-3 C15/full prM/full E nucleotides, respectively. In this instance, the term “C15 signal sequence” refers to a 16 amino acid sequence that includes as the first codon a Met residue and subsequent codons that encode the 15 amino acids of the C terminus of the capsid protein, “prM” or “full prM” refers to a full length prM polypeptide or nucleic acid sequence, and the term “E” or “full E” refers to a full length E protein or nucleic acid sequence, depending upon context.

Using a parental Den-1 C15/full prM/full E codon optimized nucleotide sequence, digestions were performed with appropriate restriction enzymes (essentially as described above for the parental genes) to extend the nucleotide sequence of clone 2/7 (PRM15/tE format; SEQ ID NO:156) with a codon optimized nucleotide sequence encoding a 16 amino acid sequence comprising a Met residue as the first amino acid linked to the last 15 contiguous amino acids of the C terminal of the DEN-1 C protein, a portion of the N-terminal portion of DEN-1 prM protein, and codon optimized nucleotide sequence fragments encoding a portion of the C-terminus of DEN-1 E protein, sufficient to generate, upon appropriate ligation the 2/7 nucleotide sequence, a recombinant nucleotide sequence comprising a C15/full length prM/full length E format nucleic acid sequence, termed extended “2/7-D1” (SEQ ID NO:203). Recombinant “extended” codon optimized nucleotide acid 2/7 (SEQ ID NO:203) encoded recombinant 2/7-D1 (C15/full prM/full E format) polypeptide (SEQ ID NO:147).

The nucleic acid sequence of clone 5/21 (PRM15/tE format; SEQ ID NO:158) was similarly extended using nucleotide sequence fragments of the DEN-1 C protein, DEN-1 prM protein, and DEN-1 E protein to generate a recombinant extended codon optimized nucleotide sequence 5/21-D1 (extending clone 5/21 in PRM15/tE polypeptide-encoding polynucleotide format to 5/21-D1 in C15/full prM/full E polypeptide-encoding polynucleotide format using wild-type Den-1 C terminal E protein fragment-encoding nucleic acid and wild-type Den-1 N terminal C15/truncated prM polypeptide-encoding nucleic acid for extension) (SEQ ID NO:201), which encoded recombinant 5/21-D1 (C15/full prM/full E format) polypeptide (SEQ ID NO:201).

Similarly, the parental DEN-4 C15/full prM/full E codon optimized nucleotide sequence was used to extend the nucleotide sequences of 2G11 (SEQ ID NO:157) and 6E12 (SEQ ID NO:159) to create “extended” 2G11-D4 and 6E12-D4 nucleotide sequences in C15/full length prM/full length E nucleotide sequence format (SEQ ID NOS:204 and 202, respectively), which encoded recombinant 2G11-D4 and 6E12-D4 (C15/prM/E format) polypeptides (SEQ ID NOS:139 and 141, respectively). All such resulting recombinant nucleic acids in C15 nucleic acid/full length prM nucleic acid/full length E nucleic acid sequence format were amplified and cloned into pMaxVax10.1 vectors, as described above. The C15 nucleic acid/full length prM nucleic acid/full length E nucleic acid sequence format is conveniently referred to as the “C15/prM/E” or “C15/full prM/full E” format.

Mice were injected with pMaxVax10.1_(6E12-D4) and pMaxVax10.1_(5/21-D1) plasmid vectors and the antiserum therefrom collected at 76 days after initial DNA plasmid vector injection, using the techniques described in Example 9. Non-extended clones pMaxVax10.1_(6E12) and pMaxVax10.1_(2G11) also were injected into mice under similar conditions. Antisera from these mice also were collected at 76 days after initial DNA plasmid vector injection. The antisera obtained from these mice were used in a plaque reduction neutralization titer assay as described in Example 9, and the inverse PRNT₅₀ titers from these assays were calculated and compared to the results of the experiments described in Example 9. The combined reciprocal PRNT₅₀ titers observed in the two sets of experiments are set forth in FIGS. 8A and 8B.

As shown in FIGS. 8A and 8B, both of the extended nucleotide sequences corresponding to “extended” clones 5/21-D1 and 6E12-D4 (C15/full prM/full E format) produced according to this technique encoded recombinant antigens that induced neutralizing antibodies against all 4 dengue serotypes in vivo, based on the PRNT assay standards (>20; dilution of serum (reciprocal)) provided in Example 9. In contrast to the nucleotide sequence of 6E12-D4 (C15 nucleic acid/full length prM nucleic acid/full length E nucleic acid format), which produced inverse (reciprocal) PRNT₅₀ titers of above 80 for each serotype, the nucleotide sequence of clone 6E12 (PRM15/truncated E gene format) produced reciprocal PRNT₅₀ titers of 35, >80, 70, and 20 for Den-1, Den-2, Den-3, and Den-4, respectively (i.e., except for Den-2, lower inverse PRNT₅₀ scores were observed for Den-1, Den-3, and Den-4 for clone 6E12 as compared to clone 6E12-D4). The nucleotide sequence corresponding to clone 2G11 also induced neutralizing antibodies against all 4 dengue virus serotypes.

These experiments demonstrate a method for producing recombinant dengue antigen-encoding nucleotide sequences, each of which encodes a fusion protein that is identical or substantially identical in length (e.g., at least about 75%, 80%, 85%, 86%, 87%, 88% or 89%, preferably at least about 90%, 91%, 92%, 93%, or 94%, and more preferably at least about 95% (e.g., about 87-95%), 96% 97%, 98%, 99%, 99.5% identity in length) to a wild-type dengue virus “C15/full length prM protein/full length E protein” fusion protein. Such recombinant nucleotide sequences encode recombinant “C15/full length prM protein/full length E” fusion proteins. Plaque reduction neutralization assay analysis of such recombinant fusion protein demonstrated that extension of the polypeptide sequence of a recombinant PRM15/trunE dengue antigen with appropriate wild-type sequence fragments to produce an “extended” C15/full prM/full E dengue antigen may increase the number of dengue virus serotypes the antibodies induced by such extended recombinant antigens neutralize in vivo and the reciprocal PRNT titer of such neutralizing antibody responses relative to recombinant PRM15/truncE dengue antigens.

It was previously shown for tick-borne encephalitis (TBE), another flavivirus, that expression of the viral prM gene and 100% of the E gene of TBE can lead to viral polypeptide(s) forming a viral-like (empty) particle (VLP), which can have physical and antigenic characteristics that are substantially similar or identical to those of a whole virus. A VLP is not an infectious particle because it does typically not contain viral genetic material capable of producing live viruses.

The present invention also includes dengue viral-like particles (VLPs), each of which comprises a dengue polypeptide sequence of the invention, including, e.g., but not limited to: (1) a fusion protein comprising a recombinant C15 signal peptide/full length prM protein/full length E protein; (2) a fusion protein comprising a recombinant full length prM protein/full length E protein (with or without any signal sequence, including any flavivirus signal peptide, described herein); (3) a fusion protein comprising a recombinant full length M protein/full length E protein (with or without any pr segment or fragment of a pr segment, described herein) and (4) a recombinant full length E protein (with or without any signal sequence, including any prM or fragment thereof, described herein);

In the present case, recombinant VLPs of the invention can be made, e.g., as follows. Cells (e.g., 293 cells) are transfected with a plasmid vector comprising a recombinant nucleotide sequence encoding a recombinant protein or fusion protein of the invention (e.g., including any of those described in (1) to (4) above. The transfected DNA sequence is translated into the corresponding recombinant protein or fusion protein, respectively; in some instances, where a recombinant fusion protein is produced, such fusion protein which may be subsequently cleaved by a protease in the cell into its components, yielding, e.g., a full length prM protein and full length E protein; C15 signal peptide; M protein and E protein; and pr segment (typically degraded). One such expressed protein/peptide associates or assembles with at least one other such protein/peptide, forming oligomers and such oligomers assemble to form recombinant VLPs in the cells. The mature particles are released from the cells into the medium by exocytosis. In some embodiments, the resulting VLPs may further comprise, associate with, or assemble with cellular membrane material. In some instances, the signal peptide sequence is not included in the resulting recombinant VLP. Following expression and formation, the VLPs of the invention can be isolated by, e.g., gradient centrifugation or other methods known in the art. Such recombinant VLPs of the invention are useful in methods for the prophylactic and/or therapeutic treatment of diseases or disorders described herein or in diagnostic assays described herein for simultaneous detection or diagnosis of antibodies against two or more (e.g., two, three, four) serotypes of dengue virus in a sample, such as a biological sample from a subject, such as a human patient at risk for dengue virus infection.

Example 13

This example further demonstrates the improved expression and/or secretion of recombinant dengue virus antigens of the invention.

293 cell cultures were prepared and transfected with pMaxVax10.1_(2G11), pMaxVax10.1_(18H6), pMaxVax10.1_(Den-3PRM15/tE CO), pMaxVax10.1_(Den-4PRM15/tE CO), and pMaxVax10.1_(null) plasmid vectors. The transfected cells were incubated for 72 hours under conditions permissive for transgene expression (or heterologous nucleotide expression) and secretion. A control vector that did not include a dengue virus nucleotide sequence (pMaxVax10.1_(null) vector) was included for comparison. The cell-free medium from each such culture was obtained, subjected to standard polyacrylamide electrophoresis and membrane blotting, and the membrane incubated with a mix of anti-DEN-1, DEN-3, and DEN-4 antibodies from mouse ascitic fluid and appropriate enzyme-conjugated secondary antibodies in accordance with the techniques described in Example 1 to produce the Western blot. The lanes corresponding to recombinant 2G11 and 18H6 dengue antigens, expressed from pMaxVax10.1_(2G11) and pMaxVax10.1_(18H6), respectively, contained significantly more dengue E protein, as reflected in darker, broader bands, than was observed in the lanes corresponding to wild-type Den-3PRM15/tE and Den-4PRM15/tE dengue antigens, expressed from pMaxVax10.1_(Den-3PRM15/tE CO) and pMaxVax10.1_(Den-4PRM15/tE CO), respectively (data not shown). The results reflect higher levels of expression and/or secretion of recombinant 2G11 and 18H6 dengue antigens, expressed from pMaxVax10.1_(2G11) and pMaxVax10.1_(18H6), respectively. These results evidence the improved secretion and/or expression of recombinant dengue virus antigens of the invention.

Example 14

This example demonstrates the production of recombinant C15/full length prM/full length E CO dengue nucleotide sequences and recombinant fusion proteins encoded therefrom using recursive sequence recombination methods. Libraries comprising such recombinant nucleotide sequences were also generated.

Den-1PRM15/tE CO, Den-2 PRM15/tE CO, Den-3 PRM15/tE CO, and Den-4 PRM15/tE CO nucleotide sequences were extended using the techniques of Example 12 to generate Den-1 C15/full length prM/full E CO, Den-2 C15/prM/full length E CO, Den-3 C15/full length prM/full length E CO, and Den-4 C15/full length prM/full length E CO dengue nucleotide sequences, respectively. The plasmid DNA for each of these 4 extended antigen-encoding sequences was isolated from E. coli amplification, purified, and used as starting sequence material in recursive sequence recombination as described in Example 1. Recombinant nucleic acids were isolated by appropriate rescue primers and ligated into pMaxVax10.1 vectors. Library transfections of E. coli were performed as described in Example 2.

Example 15

This example demonstrates the ability of a recombinant dengue virus antigen of the invention comprising a recombinant fusion protein comprising a recombinant C15 signal peptide/full length prM protein/full length E protein, recombinant full length prM protein/full length E protein, or recombinant full length M/full length E protein to induce, enhance, or modulate production of antibodies that react with (or bind or specifically bind to) dengue viruses of multiple serotypes in vivo and/or ex vivo.

Libraries of recombinant nucleic acid sequences were generated by recursive sequence recombination using as parental sequences four human codon-optimized dengue virus nucleotide sequences encoding WT Den-1, Den-2, Den-3, and Den-4 polypeptides, respectively. Each such parental nucleotide sequence comprised the following: a nucleic acid encoding a methionine, a nucleotide sequence encoding the last 15 amino acid residues of the C terminal of the capsid (C) protein of the respective WT dengue virus (which served as a signal sequence), a nucleotide sequence encoding a full length prM sequence of the respective WT dengue virus, and a nucleotide sequence encoding the full length E protein of said respective WT dengue virus. Each such parent encoded a recombinant fusion protein comprising a Met residue at the N terminal, a recombinant amino acid sequence of 15 amino acid residues that served as a signal sequence, a recombinant full length prM protein, and a recombinant E protein.

Dot blot analyses of cell culture medium supernatants, wherein each supernatant obtained from 293 cells transfected with a particular pMaxVax10.1 vector comprising a specific shuffled nucleotide sequence obtained via shuffling of the C15/full prM/full envelope dengue virus parental human codon optimized nucleotide sequences, were performed. Analyses indicated the following recombinant C15/full prM/full E dengue antigen clones were expressed, secreted, and recognized by a mix of DEN-1, DEN-2, DEN-3, and DEN-4 mouse antibodies: 21C1 (SEQ ID NO:142), 23C12 (SEQ ID NO:143), 23D5 (SEQ ID NO:144), 23F5 (SEQ ID NO:145), 23G3 (SEQ ID NO:146), 23H7 (SEQ ID NO:148), 25B6 (SEQ ID NO:236), 25B10 (SEQ ID NO:237), 25D4 (SEQ ID NO:238), 25E11 (SEQ ID NO:239), 25H4 (SEQ ID NO:240), 27A11 (SEQ ID NO:241), 27G6 (SEQ ID NO:242), 28A11 (SEQ ID NO:243), 28C1 (SEQ ID NO:244), 28D11 (SEQ ID NO:245), 28E12 (SEQ ID NO:246), 28F9 (SEQ ID NO:247), 28H3 (SEQ ID NO:248), 28H9 (SEQ ID NO:249). The pMaxVax10.1 DNA plasmid vectors comprising the shuffled C15/full length prM/full length E protein nucleotide sequences for clones 21C1, 23C12, 23D5, 23F5, 23G3, and 23H7 were isolated, purified, and injected into mice, along with pMaxVax10.1_(null), following the DNA injection regimen described in Example 5. The experiments were performed in triplicate (i.e., three mice received repeated injections of each of the indicated plasmids). Sera from the mice were obtained at appropriate times (e.g., 28, 55, and 76 days after initial DNA injection (day 0)), and analyzed by ELISA, as described above, using DEN-1, DEN-2, DEN-3, and DEN-4 virus coated ELISA plates. Average OD values observed in the ELISA assays were calculated for each DNA vector tested. The results of these calculations are set forth for day 76 (“d76”) antisera in FIG. 10.

FIG. 10 shows that all of these recombinant full length prM/full length E dengue antigens (Ag) (expressed with C15 signal peptide) induced production of populations of antibodies in vivo that reacted with all 4 DEN serotypes in vitro. OD values obtained in ELISA assays with pMaxVax10.1_(23G3) and pMaxVax10.1_(23H7) were the highest and similarly strong in all 4 assays with the 4 dengue virus serotypes. The signal peptide of the C15/full prM/full E dengue antigens is typically cleaved and thus similar or equivalent immune response and immune-stimulating results are believed produced with full prM/full E dengue antigen polypeptides.

The results of this experiment demonstrate that recombinant antigens of the invention comprising recombinant C15/full length prM/full length E protein sequences are capable of inducing, enhancing, and/or modulating the production of antibodies that react with (or bind or specifically bind to) one, two three, or 4 dengue virus serotypes in in vivo, ex vivo, and/or in vitro methods of the invention.

Example 16

This example demonstrates the secretion and/or expression characteristics of recombinant dengue antigens of the invention which comprise recombinant fusion proteins comprising, e.g., a recombinant C15 signal peptide/full length prM/full length E protein sequence, as determined by Western blot analysis. The invention also includes recombinant full length prM/full length E protein sequences and recombinant full length M/full length E protein sequences, which are produced by enzymatic cleavage of the C15 signal peptide from a C15 signal peptide/full length prM/full length E protein sequence and enzymatic cleavage of a “pr” segment from a full length prM/full length E protein sequence, respectively.

Individual 293 cell cultures were each transfected with a pMaxVax10.1 vector comprising a nucleotide sequence corresponding to one of the following eight representative recombinant clones: 23H7, 23G3, 23F5, 23D5, 23C12, 23A11, 21C1, and 21B4. The nucleic acid sequence of each such clone was generated and identified by the recursive sequence recombination (see, e.g., Example 15) and selected screening assays. Clones 23A11 and 21B4 were negative in an ELISA analysis performed by the techniques described in Examples 1 and 15. ELISA analysis indicated that the remaining six clones secreted and/or expressed recombinant antigens that induced production of antibodies that reacted with all four dengue virus serotypes in vivo (FIG. 10). After an appropriate incubation period under transgene expression-permissive conditions, the unconcentrated supernatant (cell-free portion of medium) from the transfected 293 cell cultures was collected, separated by polyacrylamide electrophoresis, and blotted onto an appropriate membrane for Western blot analysis as described in Example 1, using anti-DEN-1, DEN-3, and DEN-4 antibodies from mouse ascitic fluid. The nitrocellulose membrane bound proteins were incubated with antibodies from mouse ascitic fluid for DEN-1, DEN-3, and DEN-4. The recombinant antigens (produced by DNA shuffling and appropriate screening/selection procedures using C15/full length prM/full length E protein-encoding nucleotide sequence formats) that were positive for production of a multivalent antibody response in vivo (e.g., 23H7, 23G3, 23F5, 23D5, 23C12, 21C1) were associated with at least one visible band in the Western blot (data not shown), whereas the clones that did not induce such an antibody response (e.g., 23A11 and 21B4) were not associated with any significant bands in the Western blot, suggesting poor expression and/or secretion of proteins encoded by 23A11 and 21B4 clones as compared with the recombinant antigens 23H7, 23G3, 23F5, 23D5, 23C12, 21C1.

Additionally, bands corresponding to multimers (e.g., full length E protein dimers, trimers, and/or other multimers and/or misfolded protein complexes) were observed in the Western blot (data not shown). Since wild-type E protein is believed to form homodimeric rods on the surface of the virion (FIELDS VIROLOGY, supra), the appearance of bands corresponding to the size of E protein dimers by Western Blot indicates that the majority of the recombinant envelope proteins are correctly folding. Multimers of other polypeptides of the invention include, but are not limited to, e.g., C15/full length prM/full length E protein multimers; full length prM/full length E protein multimers; full length M/full length E protein multimers; PRM15/full length E protein multimers; PRM15/truncated E protein multimers).

The results of these experiments demonstrate that select nucleic acid sequences of the invention can be used to effectively express recombinant dengue antigens as secreted proteins.

Example 17

This example demonstrates the sequence diversity observed in various nucleic acids of the invention as compared to corresponding wild-type sequences.

The amino acid sequences of 16B4, 16G11, 18H2, 18H6, 18E11, 18E10, 18E9, 14G10, and 18D7, each of which comprised a PRM15/truncated E format, were determined and compared to the amino acid sequences of each of the WT parental polypeptides, DEN-1 PRM15/truncated E, DEN-2 PRM15/truncated E, DEN-3 PRM15/truncated E, and DEN-4 PRM15/truncated E. The results of these analyses are provided in FIG. 11. As shown in this figure, these recombinant PRM15/truncated E protein antigens of the invention exhibited complex sequence diversity, comprising multiple amino acid fragments or segments, each such fragment or segment comprising one or more amino acid residues of the four parental WT PRM15/tE protein dengue virus antigen sequences.

Recombinant polypeptide antigens corresponding to clones 23G3, 23H7, 23F5, 23C12, 23D5, 21C1, 21B4, 23A11, 5/21-D1, and 6E12-D4, each of which comprised a C15/full length prM/full length E protein format, were similarly sequenced and compared with respect to amino acid sequences of each of the DEN-1, DEN-2, DEN-3, and DEN-4 C15/full length prM/full length E fusion proteins. These antigens were similarly found to have complex sequence diversity, comprising multiple amino acid fragments or segments, each such fragment or segment comprising one or more amino acid residues of the four parental WT C15/full prM/full E protein dengue virus antigen sequences (data not shown).

Example 18

This example demonstrates, among other things, the ability of recombinant dengue virus antigens of the invention to induce or promote a protective immune response(s) in vivo in a subject, and administered ex vivo in tissue or cells of a subject, and/or in vitro in a population of cells. In this example, such immune response is induced or promoted upon expression of a recombinant dengue virus antigen of the invention from a plasmid vector comprising a nucleotide sequence encoding such antigen; the amount of plasmid vector administered is that sufficient to produce an immunogenic amount of the recombinant antigen. A protective immune response(s) can be similarly induced or promoted by in vivo, ex vivo, and/or in vitro administration to a subject of a an immunogenic amount of a polypeptide comprising such recombinant dengue virus antigen.

Dengue viruses do not typically induce clinical syndromes in adult mice. However, dengue viruses injected into the cerebellum of young mice (e.g., about 3-6 weeks old) cause paralyses, encephalitis and death. Such a mouse model can thus be used to evaluate the in vivo protective efficacy of sera produced in mice immunized with recombinant polypeptides of the invention or nucleic acids encoding such recombinant polypeptides.

Mice were immunized with 100 μg plasmid DNA of one of the following vectors: pMaxVax 10.118H6, pMaxVax10.1_(2G11-D4), pMaxVax10.1_(6E12-D4), a mix of pMaxVax10.1_(18H6)/pMaxVax10.1_(2G11-D4)/pMaxVax10.1_(6E12-D4), pMaxVax10.1_(Den-2PRM15/E CO), pMaxVax10.1_(Den-2 C15/fullprM/full E), pMaxVax10.1_(Den-3C15/fullprM/full E P), a mix of four WT DEN-1-4_(PRM15/tE CO), a mix of four WT DEN-1-4_(C15/fullprM/full E), and, pMaxVax10.1_(null) (termed “vector control” in FIG. 12); one group of mice was immunized with PBS buffer only. The nucleotide sequences corresponding to clones 18H6, 2G11-D4, and 6E12-D4 were SEQ ID NOS:235, 204, and 202, respectively; such sequences were generated using PRM15/tE (for 18H6) and C15/fullprM/full E (for 2G11-D4 and 6E12-D4) parental nucleotide sequences. Injections were repeated at day 10 after initial DNA injection. At 21 days after initial DNA injection, the mice were challenged with intracerebral injections of 100 LD₅₀ of DEN-2 virus particles. The injection and infection experiments were repeated in 5 different mice for each vector or control tested.

The mice were observed for 28 days after challenge for signs of paralyses, encephalitis and/or death. A graph of the survival rates in the first 28 days after challenge is provided in FIG. 12. The results reported in FIG. 12 demonstrate that the injection of a plasmid encoding either the 18H6 and WT DEN-2 polypeptide antigen (each in PRM15/tE format), or 6E12-D4 and WT DEN-2 (each in C15/full prM/full E format), as well a composition or mixture of the three recombinant antigens (18H6, 2G11-D4, and 6E12-4) in PBS was able to induce, enhance, or promote a protective immune response(s) against challenge with DEN-2 virus in vivo. At 28 days after DEN-2 virus challenge, all of the mice immunized with the above-listed recombinant clones were normal and thus protected. Mice immunized with either of the WT DEN-2 polypeptides (in either PRM/tE or C15/full prM/full E format) were also protected as observed 28 days after DEN-2 virus challenge. In contrast, of the mice injected with a tetravalent mix of the four WT DEN-1-4 antigens in the PRM/tE or C15/full prM/full E format only 50% were protected as observed 28 days after DEN-2 virus challenge. Of the mice injected with DEN-3 C15/full prM/full E antigen, only 50% were protected as observed 28 days after DEN-2 virus challenge. All of the PBS-immunized mice, and 75% of the mice that received injections of the vector control (pMV10.1 were dead 12 days after challenge with DEN-2 virus. The results of these experiments demonstrate that such recombinant polypeptide antigens of the invention (and nucleic acids encoding such recombinant antigens) are capable of inducing, promoting, and/or enhancing a protective immune response(s) against at least wild-type Den-2 virus in vivo. Analogous experiments can be performed using other animal models, if desired.

Analogous experiments are performed using mice (or other animal models) immunized with these recombinant antigens and challenged with Den-1, Den-3, and/or Den-4 viruses to assess the immune response(s) induced, promoted, or modulated by such recombinant antigens against one or more of these WT dengue viruses.

Similar experiments are performed using mice or other animal models immunized with other recombinant antigens of the invention, and/or nucleic acids encoding such antigens, and challenged with at least one of the four WT dengue virus serotypes.

Example 19

This example illustrates the preparation and use of recombinant antigens of the invention, including, e.g., tetravalent antigens, as diagnostic antigens or screening antigens (e.g., diagnostic tools) in methods to detect, diagnose, screen for, and/or identify the presence of antibodies against at least one, preferably two or more, and even more preferably, all 4 WT dengue virus serotypes in a sample, such as a biological sample from a mammal. Such recombinant antigens can be used for serum analyses either coated to a microtiter plate or spotted on a suitable membrane (nitrocellulose, nylon) for dot blot analysis or to a dip-stick membrane. Such microtiter plates or membranes, or dip-sticks can be coated with the recombinant antigens directly from the supernatant of transfected cells.

Four of the tetravalent antigens of the invention, corresponding to clones 2/7, 5/21, 2G11, and 6E12 as described in Example 5, each having a recombinant PRM15/tE protein format, were selected to test as diagnostic antigens. Recombinant C15/full prM/full E polypeptide antigens, which are believed to form VLPs, can also be used (e.g., 2/7-D1, 5/21-D1, 2G11-D4, and 6E12-D4). The feasibility of use of a recombinant antigen as a diagnostic antigen or diagnostic or screening tool in diagnostic or screening methods of the invention for the detection, diagnosis, screening or identification of the presence of antibodies against one or more of the 4 WT dengue virus serotypes in a sample was demonstrated by a dot blot assay. Specifically, human 293 cells were transfected with pMaxVax10.1 plasmids coding for each of the four selected tetravalent antigens. Three days after transfection, the media supernatants were harvested and spotted in serial dilution of 10-100 μl on nitrocellulose membranes. To evaluate the sensitivity of the assay, sera from mice (or sera from other mammals, such as humans) infected with dengue viruses were used in a 1:2000 dilution in PBS to detect or screen for type-specific antibodies. For detection or screening, the membranes were first incubated with the test sera and subsequently with secondary antibodies linked to a detection reagent (HRP (horseradish peroxidase), AP (alkaline phosphatase), or FITC (Fluorescein isothiocyanate)). The dot blot of the spotted media supernatants demonstrates that each of these recombinant polypeptide variants was well secreted (FIG. 13A) and recognized by all 4 type-specific anti-DEN antisera (FIG. 13B); thus, these clones can be used to detect or screen for serum antibodies against any of the 4 WT DEN serotypes. Moreover, as illustrated in FIGS. 13A and 13B, as little as about 10 μl of the crude supernatants from cell culture medium per dot in the assay was sufficient for detection of type-specific DEN antibodies in the sera, which makes manufacturing diagnostic dot blots using the recombinant dengue antigens (including, e.g., recombinant tetravalent dengue antigens) easy and inexpensive.

The results of these experiments demonstrate that even a very small amount of a recombinant dengue virus antigen of the invention (or composition thereof) provides a convenient substrate for analysis of biological samples obtained from subjects, e.g., mammals, including humans, for the presence or absence of anti-dengue virus antibodies against one or more virus serotypes. The present invention includes methods for detecting and/or screening for the presence of one or more type-specific antibodies in sera of dengue virus infected mammals, including humans. Such methods include those described above. In addition, the recombinant antigens of the invention (and nucleic acids encoding them) can be used in other diagnostic assays, such as, e.g., plate ELISA and dip stick assays, for detection of one or more type-specific antibodies in sera of dengue virus infected mammals.

The present invention also includes diagnostic dot blots comprising one or more of the recombinant dengue antigens of the inventions, and preferably including one or more of those recombinant dengue antigens of the invention that react (or bind or specifically bind to) antibodies of two or more, and preferably all four dengue virus serotypes. The invention further provides kits comprising one or more such diagnostic dot blots and/or one or more such diagnostic recombinant antigens.

Example 20

This example illustrates the dot blot analysis of a nucleotide library comprising recombinant C15/full length prM/full length E protein nucleotide sequences and selection from such libraries of those recombinant nucleotides expressing recombinant antigens (in recombinant C15/full length prM/full length E protein format or full length prM/full length E protein format) that exhibit improved secretion and/or expression in mammalian cells as compared to the secretion and/or expression WT prM/E antigens, respectively, in such mammalian cells.

Library transfections of E. coli and subsequent transfection of 293 cells in a 96 well cell culture format using plasmids comprising isolated clone DNA sequences encoding antigens in C15/full length prM/full length E protein format (or full length prM/full length E protein format) were carried out in accordance with the techniques described in Examples 2 and 14. Plasmids comprising a nucleotide sequence of the same format—C15/full length prM/full length E fusion protein-encoding nucleotide sequence (or a full length prM/full length E protein-encoding nucleotide sequence)—of one of each of the 4 WT dengue virus serotypes were used as controls in 293 cell transfections. The cell-free aspirated supernatant obtained from such 293 cell cultures after an appropriate period for recombinant antigen expression are then bound to membranes by application of standard dot blot techniques in accordance with applicable manufacturer instructions. The dot blot membranes are incubated with anti-DEN-1, DEN-2, DEN-3, and DEN-4 antibodies from mouse ascitic fluid and appropriate enzyme-conjugated secondary antibodies in accordance with the methods described in Example 1 for Western blots. Clones that indicate higher levels of secretion and/or expression than the control WT dengue C15/full prM/full E-encoding nucleic acids (WT dengue full prM/full E-encoding nucleic acids), as shown by the presence of a more intense signal on the dot blot, are then selected for further cloning and analysis.

This experiment demonstrates an exemplary strategy for analyzing a library of recombinant nucleic acids encoding recombinant dengue antigens, which have an amino acid sequence similar in size to a full length prM/full E fusion protein, for those antigens exhibiting improved or enhanced levels of secretion and/or expression in mammalian cells than corresponding WT dengue antigens of similar size.

Example 21

This example illustrates the production of recombinant dengue antigens each having an amino acid sequence identical in length to or substantially similar or substantially identical in length (e.g., at least about 75%, 80%, 85%, 86%, 87%, 88% or 89%, preferably at least about 90%, 91%, 92%, 93%, or 94%, and more preferably at least about 95% (e.g., about 87-95%), 96% 97%, 98%, 99%, 99.5% identity in length) to a fusion protein comprising a C15 signal sequence, full length prM protein, and full length E protein.

The nucleotide sequences of clone 25B10 (SEQ ID NO:259) (in C15/full prM/full E protein format) and clone 16G11 (PRM15/tE format) (SEQ ID NO:187) were digested with appropriate restriction enzymes, and the nucleotide sequence of clone 16G11 was extended at its C terminus (C terminus of its truncated E protein) with a sufficient and appropriate codon optimized nucleotide segment of the C terminus of the E protein of 25B10 (see SEQ ID NO:259) and was extended at its N terminus (N terminus of its PRM15 signal peptide) with a sufficient and appropriate codon optimized nucleotide segment of the N terminus of the C15/full prM of 25B10 (see SEQ ID NO:259) to make a recombinant clone having a C15/full length prM/full length E format, as described in Example 12.

Similarly, the nucleotide sequences of recombinant clone 25B10 (SEQ ID NO:259) (in C15/full prM/full E protein format) and recombinant clone 18H6 (PRM15/tE format) (SEQ ID NO:235) were digested with appropriate restriction enzymes, and the nucleotide sequence of clone 18H6 was extended at the C terminus of its truncated E protein with a sufficient and appropriate codon optimized nucleotide segment of the C terminus of the E protein of 25B10 (see SEQ ID NO:259) and was extended at the N terminus of its PRM15 with a sufficient and appropriate codon optimized nucleotide segment of the N terminus of the C15/full prM of 25B10 (observed in SEQ ID NO:259) to make a recombinant clone having a C15/full length prM/full length E format, as described in Example 12. Similarly, the nucleotide sequences of wild-type DEN-4 C15/full length prM/full length E protein (SEQ ID NO:234) and recombinant clones 16G11 (PRM15/tE format) (SEQ ID NO:187) and 18H6 (PRM15/tE format) (SEQ ID NO:235) were digested with appropriate restriction enzymes, and the nucleotide sequences of each of 16G11 and 18H6 were extended at the C terminus of the truncated E protein with sufficient and appropriate codon optimized nucleotide segment of the C terminus sequence of the E protein-encoding sequence of DEN-4 C15/full prM/full E protein-encoding nucleotide sequence (SEQ ID NO:234), and at the N terminus of the PRM15 with a sufficient and appropriate codon optimized nucleotide segment of the N terminus sequence of the C15/prM of DEN-4 C15/full prM/full E protein-encoding nucleotide sequence to make respective recombinant clones having C15/full length prM/full length E formats, as in Example 12. The resulting recombinant nucleic acids were named 16G11-25B10ext (SEQ ID NO:255) (i.e., 16G11 in PRM15/tE format extended to C15/full prM/full E format with appropriate nucleotide sequences of clone 25B10); 16G11-D4 (SEQ ID NO:254) (i.e., 16G11 in PRM15/tE format extended to C15/full prM/full E format with appropriate nucleotide sequences of WT Den-4); 18H6-25B10ext (SEQ ID NO:257) (i.e., 18H6 in PRM15/tE format extended to C15/full prM/full E format with appropriate sequences of clone 25B10); and 18H6-D4 (SEQ ID NO:256) (i.e., 18H6 in PRM15/tE format extended to C15/full prM/full E format with appropriate sequences of WT Den-4). These extended nucleic acid sequences were amplified and cloned into pMaxVax10.1 vectors, as described above. The corresponding recombinant polypeptide sequences for 16G11-25B10ext, 16G11-D4, 18H6-25B10ext, and 18H6-D4 are SEQ ID NOS:255, 254, 257, and 256, respectively. These amplified plasmids were analyzed in vivo by injection of mice and subsequent ELISA assays essentially as described in Example 15. The recombinant antigens of clones 25B10, 16G11-25B10ext, 16G11-D4, 18H6-25B10ext, and 18H6-D4 all induced antibodies in mice that strongly cross-reacted with all 4 dengue virus sero-types in ELISA assays (data not shown).

Example 22

This example illustrates the use of ELISpot assay (Enzyme-Linked ImmunoSPOT) to measure the number of T lymphocytes (T cells) secreting interferon gamma, as a marker of the T cell activating effects of polypeptides of the invention.

Subjects selected from an appropriate mammalian model for testing dengue virus vaccines (including, e.g., but not limited to, inbred mice, monkeys, or chimpanzees) or human patients (e.g., in a phase I clinical trial) are injected with a particular pMaxVax10.1 DNA plasmid vector (about 0.5-2 mg DNA plasmid vector in about 0.5-1 ml of an endotoxin-free carrier (e.g., PBS or another suitable saline buffer, 0.1M NaCl, or any suitable carrier which maintains the pH of the solution at about 7.4) injection volume for, e.g., humans or monkeys; plasmid amounts/compositions similar to those used in the above-described experiments for in vivo mice experiments are suitable for mice), wherein each vector comprises a particular recombinant antigen-encoding nucleotide sequence of the invention. For example, each subject is injected with a pMaxVax10.1 DNA plasmid vector that comprises one of the following: 1) a recombinant nucleotide sequence that encodes a PRM15/tE fusion protein (e.g., nucleotide sequences having SEQ ID NOS:157 (2G11), 158 (5/21), 159 (6E12), 185 (16B4), 187 (16G11), 172 (18E11), 200 (18H2), 235 (18H6)); 2) a recombinant nucleotide sequence that encodes a C15 signal peptide/full length prM protein/tE fusion protein (e.g., nucleotide sequences having SEQ ID NOS:254 (clone 16G11-D4), 255 (clone 16G11-25B10ext), 256 (clone 18H6-D4), 257 (clone 18H6-25B10ext)) (see, e.g., methods described in Examples 1 and 12); or another recombinant nucleotide sequence of the invention. Optionally, but preferably, initial DNA plasmid administration is followed by boosting with an immunogenic amount of the recombinant polypeptide encoded by the same nucleic acid included in the plasmid vector (delivered initially) or by re-administration of the same pMaxVax10.1 vector at about 14-28 days after initial DNA plasmid injection. Preferably, a second (polypeptide) protein boost (using the same recombinant polypeptide) and/or DNA plasmid vector boost (using the same plasmid vector that encodes the same recombinant polypeptide) is performed between 2-6 months after the initial infection. One group of control subjects receives no injection. Another group of control subjects receives injections of pMaxVax10.1_(null). Experiments are preferably performed with multiple subjects for each plasmid construct tested.

Sera are collected from the subjects at appropriate test times (e.g., about 60 days, 120 days, or 180 days after initial DNA plasmid injection, preferably including a time falling at least 2-4 weeks after the last recombinant polypeptide antigen/DNA vector boost) and T cells isolated by methods known in the art. For example, in human patients in a clinical study, peripheral blood mononuclear cells (PBMC) can be isolated from human blood by centrifugation over Histopaque-1077 (Sigma, St. Louis, Mo.) (using Ficoll gradient separation). T cells are isolated and purified either by staining the cells with anti-human CD2 monoclonal antibodies (mAbs) and sorting for CD2 positive (CD2⁺) cells using a FACS Vantage SE or removing cells that stained with mAbs specific for CD14, CD20, CD56 and CD94 by magnetic beads (e.g., Dynabeads, Dynal, Lake Success, N.Y.) following the manufacturer's instructions. Antibodies useful in such techniques can be purchased from Pharmigen (San Diego, Calif.). Magnetic separation of the T cells using Dynal Dynabeads is performed by first labeling PBMCs with pure monoclonal antibodies against CD14, CD20, CD56 and CD94, then labeling the cells with Sheep anti-mouse Dynabeads according to manufacturer's instructions. Non-T cells are removed by depleting with a magnet. The purity of the T cells should be 96-99% when analyzed by staining with anti-CD3 mAbs purchased from Pharmigen (San Diego, Calif.).

A 96 well ELISpot plate is coated with antibody (immunoglobulin (Ig)) that is specific for IFN-gamma. For example, NIB42 capture antibody (Pharmigen) can be used for plate coating. Other capture antibodies that can be used are known in the art. The antibody (Ig) binds to the nitrocellulose base of the ELISpot plate.

The activated T cells are transferred to the ELISpot plate, and IFN-gamma is released during the incubation period. IFN-gamma released locally around T cells is bound, and therefore is “captured” by the IFN-gamma specific antibody. The cells and any excess cytokines are washed off. A second antibody that is also specific for IFN-gamma is added. This antibody is coupled to an enzyme that is capable of converting a substrate into an insoluble colored product. For example, 4S.B3 biotinylated detection antibody (Pharmigen) can be used to quantify IFN-gamma production. Other detection antibodies that can be used are known in the art. The plates are washed once more, and the enzyme substrate is added. The substrate is converted into the insoluble product, forming spots of color that represent the areas of captured cytokines that were secreted by adjacent T cells. The colored spots are counted using a microscope or digital-imaging system such as an ELISpot reader (such as the ELISpot reader sold by Scanalytics, Inc.) by following manufacturer's instructions.

T cells obtained from sera of subjects that have been injected with a pMaxVax10.1 vector comprising a recombinant nucleic acid sequence of the invention exhibit a significantly higher level of IFN-gamma expression than T cells obtained from the sera of subjected injected with pMaxVax10.1_(null) or receiving no DNA injection.

This example illustrates the use of ELISpot to quantify the number of circulating antigen-specific CD8+ T cells. This example also demonstrates the antigen-specific increase in T cell activity associated with administering a DNA sequence encoding a recombinant polypeptide of the invention to a subject host, e.g., mammal.

Example 23

This example demonstrates ELISPot analysis of T cell associated IFN-gamma expression induced or modulated by recombinant polypeptides of the invention.

PBMCs are isolated from a mammal model or human and stimulated with one or more recombinant antigens of the invention, such as, e.g., but not limited to, SEQ ID NOS:66 (clone 2G11), 67 (5/21), 69 (6E12), 89 (16B4), 93 (clone 16G1), 108 (18E11), 109 (18H2), 110 (18H6), all of which are of the PRM15/tE format, or SEQ ID NOS:254 (clone 16G11-D4), 256 (16G11-25B10ext), 257 (18H6-D4), and/or 258 (18H6-25B10ext), all of which are of the C15/full prM/full E format, for about 6-24 hours and then mixed with enzymatically-tagged cytokine antibodies. The mixture is then chemically treated so that bound antibody-cell complexes (i.e., cytokine-secreting cells) are stained blue.

A suitable IFN-gamma ELISpot assay that can be used for quantitative IFN-gamma ELISPot analysis is commercially available through Biosource International, Camarrillo, Calif. The assay is performed according to the manufacturer's instructions and using standard techniques. For example, enumeration of IFN-gamma secreting cells in single cell suspension is performed by adding 50 μl of diluted coating antibody to each well followed by the addition of 50 μl of PBS. Each well is incubated overnight at 4° C. Samples are then aspirated and washed 5 to 10 times with wash buffer. 200 μl of post-coating solution is added into each well and wells are incubated 1 hour at 37° C. or overnight at 4° C. The wells are aspirated and not washed. Wells are 100 μl of pre-stimulated single cell preparation are added into the wells. The plate is covered with the plate cover and incubated for 5 hours at 37° C. in a humidified atmosphere containing 7% CO₂. The wells are aspirated, 200 μl ice-cold deionized water is added, and the plate is placed for 10 min on melting ice. The wells are washed 10 times with PBS. 100 μl of diluted biotinylated Antibody solution is added, the plate is covered and incubated for 1 hour at 37° C. or overnight at 4° C. The wells are aspirated and washed 5 to 10 times with PBS. 50 μl of diluted-labeled anti-biotin antibody solution (GABA) is added to each well. The plate is covered and incubated 1 hour at 37° C. The wells are aspirated and washed 5 to 10 times. 30 μl of activator solution is added to each well. The spot development is followed by light microscopy. When clear spots have developed, the reactions are stopped by rinsing the wells with distilled water. The results using recombinant clones in the PRM15/tE fusion protein format and C15/full length prM/full length E fusion protein format are compared with results obtained with subjects immunized with wild type dengue PRM15/tE fusion proteins and C15/full length prM/full length E fusion proteins. Results are also compared with subjects that were not immunized with any dengue antigen.

The results of such ELISpot analysis indicate that recombinant polypeptides of the invention are able to induce and/or modulate IFN-gamma expression from T cells. The present invention also provides alternative techniques for conducting ELISpot assays with recombinant polypeptides and/or polynucleotides of the invention.

Example 24

This example demonstrates a T-cell proliferation assay to measure the proliferation of a patient's T cells in response to stimulation with recombinant polypeptide antigens of the present invention.

T cells in a suitable subject (preferably a human patient) are stimulated with pMaxVax10.1 vectors as described in Example 22, followed by removal of a whole blood-based sample and separation of specific T-cell subsets. Monoclonal antibodies attached to paramagnetic particles recognize specific cell surface markers, and T cells are isolated when the sample is placed in a magnetic field. Several signs of T cell activation can be measured by such a technique. For example, increased intracellular ATP accumulation is associated with T-cell activation, so a luminescent ATP assay employing firefly luciferase can be used to measure the level of activation among isolated cells (an example of a commercially suitable assay in this respect is the Luminetics Assay, available through Cylex).

Additionally or alternatively, T cell proliferation is measured by ³H-thymidine incorporation: Briefly, 293 cells (or COS-7 cells or other cells of interest) are transfected with a pMaxVax10.1 plasmid vector encoding a recombinant dengue antigen of the invention, e.g., a recombinant tetravalent dengue polypeptide antigen (as described, e.g., in Example 22) or a control plasmid vector lacking an insert (e.g., pMaxVax10.1_(null) vector). The Effectine HTP (Qiagen, Valencia, Calif.) 96-well transfection method is used for plasmid transfections according to manufacturer's instructions.

Twenty-four hours after transfection, approximately 5×10⁴ purified T cells (purified as described, e.g., in Example 21) are cultured in triplicate in the presence of irradiated (5000 radians (rads)) transfectants and soluble anti-human CD3 mAbs (5 μg/mL) in a 96-well plate culture format (e.g., the system commercially available through VWR, Westchester, Pa.) at 37° C. in a humidified atmosphere containing 5% CO₂ in Yssel's medium supplemented with 10% FCS (200 μl/well) for a total of 3 days (72 hours). 1 microCurie (μCi)/well of ³H-thymidine (Amersham, Piscataway, N.J.) is added by pulsing to the cell cultures during the last 8 hours of the culture period, and the cells are harvested for counting onto filter paper by a cell harvester (Tomtec, Hamden, Conn.). ³H-thymidine uptake/incorporation in the cultured cells is determined by measuring the radioactivity on the dried filters using a MicroBeta scintillation counter (Wallac, Turku, Finland) according to the manufacturer's instructions. Proliferation of T cells is expressed as the mean counts per minute (cpm) of triplicate wells.

At least some of the recombinant polypeptide antigen-encoding plasmids of the invention (e.g., recombinant tetravalent polypeptide antigen-encoding plasmids of the invention) induce T cell responses and/or T cell proliferation at levels significantly higher than those seen in a plasmid control and/or at least as high a level as observed with plasmids encoding a WT dengue virus fusion protein having about the same length or substantially similar or substantially identical length (e.g., at least about 75%, 80%, 85%, 86%, 87%, 88% or 89%, preferably at least about 90%, 91%, 92%, 93%, or 94%, and more preferably at least about 95% (e.g., about 87-95%), 96% 97%, 98%, 99%, 99.5% identity in length) as (e.g., if the plasmid encodes a recombinant polypeptide antigen comprising a PRM15/tE fusion protein, a plasmid encoding a WT dengue polypeptide comprising a DEN 1, 2, 3, or 4 PRM15/tE fusion protein is used for comparison; if the plasmid encodes a recombinant polypeptide antigen comprising a C15/full length prM/full length E fusion protein, a plasmid encoding a WT dengue polypeptide comprising a DEN 1, 2, 3, or 4 C15/full length prM/full length E fusion protein is used for comparison). Plasmid vectors encoding a recombinant PRM15/tE-encoding sequence or recombinant C15/full length prM/full length E-encoding sequence that induce or promote T cell proliferation and/or activation at a level significantly above that induced or promoted by the plasmid vector control (or other control), and preferably at a level at least about similar or equivalent to that induced or promoted by a pMaxVax10.1 vector comprising a wild-type dengue virus PRM15/tE-encoding sequence or C15/full length prM/full length E-encoding sequence, respectively, can be identified and selected or isolated for further characterization (e.g., sequencing, effects in other T cell activity assays, and/or diversity analysis).

This experiment demonstrates assays suitable for assessing T-cell proliferation effects of recombinant polypeptide antigens of the invention. This experiment also demonstrates that at least some recombinant antigens of the invention induce or promote T cell proliferation and/or T cell activation effects in mammalian cells.

Example 25

This example illustrates the use of a cytotoxicity assay to quantify the amount of T cells expressing a recombinant dengue virus antigen of the invention in a particular sample.

Populations of 293 cells are transfected with a pMaxVax10.1 plasmid vector encoding a recombinant polypeptide antigen of the invention, with a pMaxVax10.1_(null) plasmid vector (control), or with nothing as described in Examples 22. The transfected cells are labeled with a radioactive isotope of chromium (⁵¹Cr). A subject's T cells (e.g., human T cells) that have been isolated and purified as described in Example 21 are obtained, mixed with the target cells, and the mixture is incubated for several hours. Lysis of antigen-expressing cells releases ⁵¹Cr into the medium. Recombinant antigen-specific lysis is calculated by comparing lysis of 293 cells expressing a recombinant antigen of the invention or control antigen in the presence or absence of the subject's effector cells. The results of such experiments are expressed as a percentage of cells exhibiting recombinant antigen-specific lysis. The percentage of cells exhibiting recombinant antigen-specific lysis in such experiments demonstrates the recombinant antigens of the invention are capable of inducing a significant cytotoxic immune response in a subject. This example provides a method of measuring the amount (number) of T cells expressing recombinant dengue virus specific antigens.

Example 26

This example illustrates the use of a tetramer assay to measure the amount (number) of CD8+ T-cells that recognize (and/or bind or selectively bind) a specific epitope in a recombinant polypeptide of the invention.

Putative MHC class I binding sequences occurring in recombinant polypeptides of the invention are identified by sequence analysis using a suitable epitope-identification algorithm or program, such as the BONSAI algorithm developed at Stanford University, the TEPITOPE algorithm, the SYFPEITHI program (which applies the algorithm of Rammensee et al.), the MAPPP program (available at “http” colon, slash, slash, “www” dot “mpiib-berlin” dot “mpg” dot “de” slash “MAPPP” slash), the PREDEP program (available at “http” colon, slash, slash, “bioinfo” dot “md” dot “huji” dot “ac” dot “il” slash “marg” slash “Teppred” slash “mhc-bind” slash), the ProPred program (“http” colon, slash, slash, “www” dot “imtech” dot “res” dot “in” slash “raghava” slash “propred”), and the BIMAS program (available at “http” colon, slash, slash, “bimas” dot “dcrt” dot “nih” dot “gov” slash “molbio”) which are variously described in, e.g., Altuvia et al., Mol. Immunol., 31, 1-19 (1994), Brusic et al., Nuc Acids Res., 22, 3663-3665 (1994), Hammer et al., J. Exp. Med., 180, 2353-2358 (1994), Parker et al., J. Immunol., 152, 163-175 (1994), Sturniolo et al. Adv. Immunol., 66, 67-100 (1997), and Cunha-Neto, Braz. J. Med. Biol. Res., 32(2), 199-205 (1999), or other suitable software to analyze the amino acid sequence of recombinant polypeptides of the invention. The amino acid sequences of test polypeptides also can be subjected to sequence analysis identify predicted T cell epitopes using software programs such as EpiPlot 1.0 (available at “http” colon, slash, slash, “genomics” dot “com” slash “software” slash “files” slash “epiplot1” dot “exe”), the EPIMER/EPIMAX algorithm developed at the Brown University School of Medicine (currently managed by EpiVax, Inc. (Providence, R.I.), or a sequence analysis algorithm such as the SOHHA algorithm (described in Reyes et al., Methods Enzymol 202:225-38 (1991)). In addition, information and software programs for identifying and/or predicting T cell epitopes using software programs in polypeptide sequences of the invention can be obtained at the following websites: “http” colon, slash, slash, “www” dot “epitope-informatics” dot “com”, “http” colon, slash, slash, “www” dot “vaccinome” dot “com” slash “index” dot “htm”, “http” colon, slash, slash, “syfpeithi” dot “bmi-heidelberg” dot “com” slash “scripts” slash “MHCServer” dot “dll” slash “home” dot “htm”, “http” colon, slash, slash, “www” dot “imtech” dot “res” dot “in” slash “raghava” slash “propred” slash “index” dot “html”, “http” colon, slash, slash, “bimas” dot “dcrt” dot “nih” dot “gov” slash “molbio” slash “hla bind” slash “http” colon, slash, slash, “epivax” dot “com” slash “epitope” dot “html”, and “http” colon, slash, slash, “www” dot “tepitope” dot “com”

Related techniques for identifying T cell and B cell epitopes are described in, e.g., International Patent Application WO 99/53058, Kammerer et al., Clin Exp Allergy 27:1015-1026 (1997), Sakakibara et al. J Vet Med Sci 60:599-605 (1998), Jiang et al., Protein Sci 9:403-416 (2000), Milik et al., Nat Biotech 16(8):753-756 (1998), Walshet et al., J Immunol Meth, 121:1275 (1989), and Schoofs et al. J Immunol 140:611-616 (1987).

T cells are isolated and purified as described in Example 22. The T cells are mixed with a polypeptide comprising a single predicted MHC I binding sequence or T cell epitope identified by, e.g., one of the above-described methods/analyses, joined to a class I HLA molecule (a molecule on the surface of CD8+ cells that helps display the epitope to the immune system)—more particularly, a synthetic tetrameric form of a fluorescently labeled MHC Class 1 molecule. As CD8+ T cells recognize antigen in the form of short peptides bound to MHC class 1 molecules, T cells with the appropriate T cell receptor bind to the labeled tetramers and can be quantified by flow cytometry as described in Example 2. T cells with receptors for a specific epitope without prior antigen stimulation are quantified. The results of such experiments are highly quantitative and sensitive.

At least some of the recombinant polypeptides contain epitopes that react with (or bind or specifically bind to) T cells in the above-described assay. The results of these experiments confirm the cytotoxic effects attendant at least some of the recombinant polypeptides of the invention.

This experiment provides a suitable assay for quantifying CD8+ T cells that react with (or bind or specifically bind to) a polypeptide of the invention. This experiment also provides a technique for identifying novel T cell epitopes in such polypeptides.

Example 27

This example illustrates a method of immunization of rhesus macaque monkeys with an effective amount of an immunogen-encoding or antigen-encoding nucleotide sequence of the invention sufficient to induce a protective immune response(s) in the monkeys against one or more (and preferably against two or more) dengue virus serotypes. Such an amount may be an immunogenic amount or an antigenic amount. DNA-based immunization methodologies using wild-type dengue virus antigens and rhesus macaque monkeys have been described (see, e.g., the methods described in Raviprakash et al., J. Gen. Virology 81:1659-1667 (2000), which is incorporated herein by reference in its entirety for all purposes), and such methods can alternatively be employed herein or adapted as desired.

RNA or DNA sequences of the invention can be used in the methods described below. In the studies below, an immunogen- or antigen-encoding DNA sequence is employed for each study. Each study is conducted according to the guidelines in “Guide for the Care and Use of Laboratory Animals, Institute of Laboratory Animal Resources,” National Research Council, DHHS Publication No. NIH-86-23 (1985). Animals are obtained from the Walter Reed Army Institute of Research animal facility in Forest Glen, Md.

Rhesus macaque monkeys (Macaca mulatta), aged approximately 7 to 24 years, are pre-tested for the presence of antibodies against dengue viruses of each of the four dengue virus serotypes by ELISA assay and plaque-reduction neutralization test. Those monkeys that do not show evidence of exposure to such dengue viruses (e.g., are sero-negative for such dengue viruses) are divided into several groups of three monkeys/group. Each monkey in a non-control group is inoculated intramuscularly (i.m.) or intradermally (i.d.) with a composition comprising an immunogenic amount or antigenic amount (e.g., about 0.1 μg to about 5 mg) of pMaxVax10.1 DNA plasmid vector comprising a nucleic acid sequence of the invention (as described in Example 22) in a carrier, such as, e.g., an endotoxin-free carrier, such as, e.g., a pharmaceutical carrier (e.g., PBS or another suitable saline buffer, 0.1M NaCl, or any suitable carrier which maintains the pH of the solution at about 7.4); the carrier optionally comprises additional excipients such as preservative agents. Alternatively, another plasmid vector comprising such nucleotide sequence of the invention can be used. Each monkey of a particular group is immunized with the same vector in the same concentration in the same manner.

In one study, a pMaxVax10.1 vector comprising a nucleic acid sequence encoding a recombinant PRM15/tE dengue virus antigen fusion protein is tested. In such study, e.g., each monkey in a group is administered with a pMaxVax10.1 vector comprising a nucleic acid sequence comprising a recombinant PRM15 gene and recombinant truncated envelope (E) gene. Such nucleic sequence comprises, e.g., one of the following: SEQ ID NOS:157 (clone 2G11), 158 (clone 5/21), 159 (clone 6E12), 185 (clone 16B4), 187 (clone 16G11), 200 (clone 18H2), 172 (clone 18E11), and 235 (clone 18H6).

In a second study, a pMaxVax10.1 vector comprises a nucleic acid sequence encoding a recombinant C15/full length prM/full length E dengue virus antigen fusion protein. In such study, e.g., each monkey in a group is administered with a vector comprising a recombinant C15 gene, recombinant full length prM gene and recombinant full length E gene. Such nucleic sequence comprises, e.g., one of the following: SEQ ID NOS:254 (clone 16G11-D4), 255 (clone 16G11-25B10ext), 256 (clone 18H6-D4), and 257 (clone 18H6-25B10ext).

As a control, each monkey of a control group of three monkeys is mock immunized with the carrier alone (with no plasmid vector). As a second control, each monkey of another group of three test monkeys is immunized with the same amount of pMaxVax10.1_(null) plasmid vector control (null vector) (e.g., for i.m. or i.d. injection, 1 mg of pMaxVax10.1_(null) plasmid vector is administered in the same total volume, in one injection or divided in several injections, as in administration of recombinant vectors).

While intramuscular and intradermal injection administrations are among the preferred routes of administration, vectors comprising the immunogen-encoding or antigen-encoding DNA or RNA sequences of the invention (and vector compositions thereof) can be administered by a variety of standard routes of administration as described herein, vector compositions can be any of those described herein, including, e.g., but not limited to, needle injection, parenteral administration, subcutaneous administration, gene gun or other biolistic delivery device, impression through the skin, oral delivery, inhalation, topical or transdermal delivery (e.g., using a transdermal patch or ointment).

The total amount of recombinant plasmid vector administered depends upon the manner of administration. For administration via needle injection, about 0.3 mg to about 2 mg, usually about 1 mg, of such DNA plasmid vector is typically administered. For administration via gene gun, about 0.1 μg to about 100 μg plasmid vector, typically from about 1 μg to about 10 μg plasmid vector, are administered.

The total volume of the plasmid vector composition for each immunization typically depends upon the amount or dose of DNA vector (in mg) to be administered. For 1 mg DNA vector administered via injection, the total volume is typically about 0.5 ml. The total volume of the vector composition can be administered in one administration or divided into several smaller volumes administered in several administrations consecutively, sequentially, or simultaneously, at one or more sites in the animal.

For example, for intramuscular administration, 0.5 ml total volume of DNA plasmid vector composition is typically administered to a monkey in one injection or two injections in a muscle of the monkey, such as, e.g., the tibialis anterior muscle. For intradermal administration, e.g., 0.5 ml total volume of DNA plasmid vector composition is typically administered to a monkey (e.g., anterior thoracic dermal area of monkey) in five administrations of 0.1 ml each. One of skill in the art can employ and/or adapt other administration formats, volumes, and compositions known in the art.

The DNA plasmid vector is optionally administered with one or more transfection promoting agents (including, e.g., those described herein), one or more immunostimulatory sequences (including, e.g., nucleotide sequences comprising CpG islands or unmethylated CpG motifs, termed ISS motifs) (see Krieg, Trends in Microbiol 7:64-65 (1999)), one or more cytokines (e.g., GM-CSF), one or more adjuvants (e.g., LAMP, alum, other known adjuvants), and/or one or more costimulatory molecule-encoding sequences. Such costimulatory molecule-encoding sequences may include, but are not limited to, e.g., at least one mammalian B7-1-encoding nucleotide sequence or B7-1 homolog-encoding nucleotide sequence, at least one mammalian B7-2-encoding nucleotide sequence or B7-2 homolog-encoding nucleotide sequence, or at least one novel co-stimulatory polypeptide-encoding nucleotide sequence described in International Patent Application WO 02/00717, filed Jun. 22, 2001 (including e.g., at least nucleotide sequence encoding, e.g., a CD28 binding protein (“CD28BP”), such as CD28BP-15), or any combination thereof. Adjuvants can be delivered to a subject with one or more recombinant dengue polypeptides of the invention. In one format, a DNA vector comprising a sequence that encodes a recombinant antigen polypeptide and an intracellular targeting adjuvant (e.g., LAMP protein sequence) and/or lysosomal associated membrane protein can be used for delivery to a subject.

The immunizations (e.g., vaccinations) are repeated at least twice after the initial inoculation with the identical vector composition in the same concentration (e.g., 1 mg vector/0.5 ml total volume). Each subsequent inoculation or immunization is often termed a “boost.” The second inoculation (i.e., first “boost”) is administered two weeks or one month after initial inoculation. The third inoculation (i.e., second “boost”) is typically administered at about 2 to about 6 months following the initial inoculation (most typically at about 4, about 5, or about 6 months after the initial inoculation). Those monkeys receiving intradermal inoculation(s) can be inoculated a fourth time with the same vector in the same amount at 12, 14 or 18 months after the initial inoculation. Groups of control animals are administered with an “empty” plasmid vector (e.g., pMaxVax10.1_(null)) in the same concentration (e.g., 1 mg/0.5 ml) or a carrier (e.g., pharmaceutical carrier) in the same volume (e.g., 0.5 ml) alone during the first (initial), second, third, and fourth inoculations.

Alternatively, for another group(s) of monkeys, one or more recombinant polypeptides of the invention can be used for the inoculation or immunization boosts following the initial inoculation as described in Example 28. The effectiveness of one or more boosts using a polypeptide antigen of the invention versus an immunogen-encoding or antigen-encoding DNA or RNA sequence of the invention is compared. Such polypeptide antigens can be administered by a variety of standard routes of administration as described herein, including, e.g., parenteral, intramuscular and intradermal delivery (see Example 28).

While intramuscular and intradermal injection administrations are among the preferred routes of administration, vectors comprising the immunogen-encoding or antigen-encoding DNA or RNA sequences of the invention can also be administered by a variety of standard routes of administration as described herein, including, e.g., parenteral administration, subcutaneous administration, or by biolistic injection (e.g., a gene gun).

Sera are obtained from each monkey (e.g., by bleeding each monkey daily for 5 to 10 days). The sera are tested for the presence of antibodies specific for each dengue virus serotype (DEN-1, DEN-2, DEN-3, and DEN-4) by ELISA assay and/or PRNT assay as described previously. The ability of sera of inoculated monkeys to neutralize each of the four dengue virus serotypes is determined by PRNT assay. A negative control for the PRNT assays consists of a pool of the sera obtained from monkeys prior to being immunized A “prime” is typically defined as the first immunization.

The effectiveness of the prophylactic administration of recombinant polypeptide-encoding nucleotide acids of the invention in such amount(s) is assessed by challenge of the immunized monkeys with one or more dengue viruses, preferably of multiple dengue virus serotypes, and observation of the occurrence of viremia (including, e.g., onset, levels, and duration of viremia) or nonoccurrence of viremia in such monkeys thereafter.

Groups of monkeys are selected for in vivo challenge with one or more live dengue viruses of the four serotypes. For one challenge study, monkeys are challenged with a dengue virus at about 9 or at about 12 months after the initial immunization. For a second challenge study, monkeys are challenged with a dengue virus at about 15 or at about 18 months after the initial immunization. For each challenge study, a control group of monkeys is also maintained and observed for comparison. Serum is obtained by bleeding from each monkey prior to the viral challenge. Then, each monkey is inoculated via, e.g., subcutaneous injection in a leg or arm with about 0.5 ml of a solution comprising about 1×10² to about 5× plaque forming units (p.f.u.) (typically about 1×10² to about 1.5×10⁴p.f.u., 1×10² to about 1.25×10⁴ p.f.u., about 1×10³ p.f.u. or about 1.25×10⁴ p.f.u.) of a live dengue virus of one of the four serotypes. At least two monkeys are injected with a dengue virus of one serotype; all dengue virus serotypes are employed in each challenge study.

Sera are obtained by bleeding the inoculated and control monkeys at about 15 and about 30 days after initial challenge; alternatively, sera can be obtained at other days following initial challenge. Viremia and antibodies specific for each dengue virus serotype are measured in the sera. Viremia is measured by following the procedure set forth in Raviprakash et al., J. Gen. Virology 81:1659-1667 (2000). Antibody response is measured as described previously or by methods known in the art (see, e.g., Raviprakash et al., J. Gen. Virology 81:1659-1667 (2000)). Responses in monkeys immunized with a plasmid vector encoding a recombinant antigen of the invention are compared with monkeys injected with the null vector or with carrier alone.

For those monkeys inoculated with a plasmid vector comprising a recombinant dengue virus antigen-encoding nucleic acid of the invention that survive challenge with a particular dengue virus, such recombinant dengue virus antigen-encoding nucleic acid induces a protective immune response against infection by such dengue virus. Protection may be full or partial protection. Full protection is observed in those immunized monkeys that are completely protected from developing viremia; partial protection is observed in those immunized monkeys that only develop a reduced viremia compared to control monkeys.

The present invention includes recombinant nucleic acids that encode recombinant PRM15/tE dengue virus antigen fusion proteins and/or recombinant C15/full length prM/full length E dengue virus antigen fusion proteins (and vectors thereof) that induce protective immune response(s) in vivo against dengue virus infection by at least one dengue virus serotype, preferably against at least two dengue virus serotypes, more preferably against at least three or at least four dengue virus serotypes, following in vivo challenge by one or more dengue virus serotypes in a non-human primate model, e.g., rhesus macaque monkey model.

In another aspect, the invention includes recombinant nucleic acids that encode recombinant tE dengue virus antigen fusion proteins and/or recombinant full length prM/full length E dengue virus antigen fusion proteins (and vectors thereof) that induce protective immune response(s) in vivo against dengue virus infection by at least one dengue virus serotype, preferably against at least two dengue virus serotypes, more preferably against at least three or at least four dengue virus serotypes, following in vivo challenge by one or more dengue virus serotypes, in a non-human primate model. In such aspect, the encoded recombinant polypeptide does not include the PRM15 signal peptide or C15 signal peptide.

The invention also includes neutralizing antibodies that protect against dengue virus infection by one of the four dengue virus serotypes. Such antibodies are produced in response to administration to a subject of a recombinant nucleic acid that encodes a recombinant PRM15/tE dengue virus antigen fusion protein or a recombinant C15/full length prM/full length E dengue virus antigen fusion protein and can be isolated by standard methods from the sera of such subjects. Vectors comprising such nucleic acids may be administered to the subject.

Example 28

This example illustrates a method of immunization of rhesus macaque monkeys with an effective amount of one or more recombinant dengue virus polypeptide antigens of the invention sufficient to induce a protective immune response(s) in the monkeys against one or more (and preferably against two or more) dengue viruses. Such an amount may be an immunogenic amount or an antigenic amount.

Immunization methodologies using dengue virus antigens and primates have been described, and such methods can alternatively be employed herein or adapted as desired. See, e.g., Eckels et al., Amer J Medicine and Hygiene (1994) 50:472-487; Raviprakash et al, J. Gen. Virol. (2000) 81: 1659-1667).

A recombinant polypeptide of the invention can be used in the methods described below. Each study is conducted according to the guidelines in “Guide for the Care and Use of Laboratory Animals, Institute of Laboratory Animal Resources,” National Research Council, DHHS Publication No. NIH-86-23 (1985). Animals are obtained from the Walter Reed Army Institute of Research animal facility in Forest Glen, Md.

Rhesus macaque monkeys (Macaca mulatta), aged approximately 7 to 24 years, are pre-tested for the presence of antibodies against dengue viruses of each of the four dengue virus serotypes by ELISA assay and plaque-reduction neutralization test. Those monkeys that do not show evidence of exposure to such dengue viruses (e.g., are sero-negative for such dengue viruses) are divided into several groups of three monkeys/group. Each monkey in a non-control group is inoculated intramuscularly (i.m.) or intradermally (i.d.) on day 0 with a composition comprising an immunogenic amount or antigenic amount (e.g., about 0.1 μg-1 mg, about 0.5-500 μg, about 0.5-100 μg, about 1-50 μg, about 1-10 μg, about 5-30 μg, about 15-25 μg, or about 0.5 μg to about 2 μg) of a recombinant polypeptide of the invention in a carrier (as described in Example 22). The carrier may comprise, e.g., an endotoxin-free carrier, such as, e.g., a pharmaceutical carrier (e.g., PBS or another suitable saline buffer, 0.1M NaCl, or any suitable carrier which maintains the pH of the solution at about 7.4) and may optionally comprise additional excipients such as preservative agents. The recombinant polypeptide may be administered with an adjuvant (e.g., Freund's incomplete adjuvant (FIA) or aluminum hydroxide) and/or liposome(s) in an amount such as is typically delivered with a DNA vector or DNA vaccine. One of skill can determine the type of adjuvant or liposome and the amount to be included. One group of monkeys serves as the control groups; monkeys of the control group are mock immunized with PBS or another suitable saline solution.

Alternatively, a monkey is administered with a composition comprising a carrier (e.g., pharmaceutically acceptable carrier) and a nucleic acid vector encoding a recombinant polypeptide of the invention. A sufficient amount of the composition is administered such that an immunogenic amount or antigenic amount of the encoded polypeptide is produced (e.g., from about 50 μg to about 10 mg, about 100 μg to about 5 mg, about 1-5 mg, or from about 2-5 mg nucleic acid is administered). Typically, the carrier is PBS. At least one adjuvants and/or liposome may be included in the composition in an amount as is typically delivered with a DNA vector or DNA vaccine. One of skill could readily determine the type of adjuvant or liposome and the amount to be included.

In one study, a group(s) of monkeys is administered with at least one recombinant dengue virus polypeptide of the invention comprising a PRM15/tE dengue virus antigen fusion protein (or combination of such polypeptides). For example, such polypeptide(s) may comprise one or more of the following sequences: SEQ ID NOS:66 (clone 2G11), 67 (clone 5/21), 69 (clone 6E12), 89 (clone 16B4), 93 (clone 16G11), 109 (clone 18H2), 108 (clone 18E11), and 110 (clone 18H6). Alternatively, the recombinant polypeptide that is administered comprises a recombinant tE dengue virus polypeptide antigen and does not include a PRM15 signal peptide.

In a second study, a group(s) of monkeys is administered with at least one recombinant dengue virus polypeptide of the invention comprising a recombinant C15 signal peptide/full length prM/full length E dengue virus antigen fusion protein. Such polypeptide comprises, e.g., one of the following sequences: SEQ ID NOS:250 (clone 16G11-D4), 251 (clone 16G11-25B10ext), 252 (clone 18H6-D4), and 253 (clone 18H6-25B10ext). Alternatively, the recombinant polypeptide that is administered comprises a recombinant full length prM/full length E dengue virus antigen fusion protein and does not include a C15 signal peptide. As will be understood by those of skill, other polypeptides of the invention may be employed in these methods. Preferably, a recombinant tetravalent dengue virus polypeptide antigen of the invention is employed.

As described in Example 27, such recombinant dengue virus polypeptide can be optionally be administered with one or more transfection promoting agents (including, e.g., known agents and those described herein), one or more immunostimulatory sequences (e.g., nucleotide sequences comprising CpG islands or unmethylated CpG motifs, ISS motifs), one or more cytokines (e.g., GM-CSF), one or more adjuvants, and/or one or more costimulatory molecule polypeptides (e.g., B7-1 polypeptide, B7-2 polypeptide, a novel co-stimulatory polypeptide described in International Patent Application WO 02/00717, filed Jun. 22, 2001 (including e.g., CD28 binding protein, such as CD28BP-15), or a combination thereof, as well as a carrier(s) or excipient(s), such as, e.g., a preservative agent(s).

The manner of administration of the polypeptide compositions can be any of those described herein, including, e.g., but not limited to, needle injection, impression through the skin, subcutaneous administration, parenteral administration, oral delivery, inhalation, or topical or transdermal delivery (e.g., using a transdermal patch or ointment). The total amount of recombinant polypeptide administered depends upon the manner of administration. For administration via needle injection, about 0.1 μg-1 mg, about 0.5-500 μg, about 0.5-100 μg, about 1-50 μg, about 1-10 μg, about 5-30 μg, about 15-25 μg, or about 0.5 μg to about 2 μg of such recombinant polypeptide can be administered. One of skill in the art can readily determine the amount of polypeptide preferable for other routes of administration.

The total volume of the polypeptide composition for each immunization typically depends upon the amount or dose of polypeptide (in mg) to be administered as described in Example 27. The total volume of the composition can be administered in one administration or divided into several smaller volumes administered in several administrations consecutively, sequentially, or simultaneously, at one or more sites in the animal. For example, for intramuscular administration, the total volume of the polypeptide composition may be administered to a monkey in one injection or two injections in a muscle of the monkey, such as, e.g., the tibialis anterior muscle. For intradermal administration, a similar total volume of polypeptide composition can be divided into smaller, equal volumes and delivered to a monkey (e.g., anterior thoracic dermal area of monkey) in multiple administrations. One of skill in the art can employ and/or adapt other administration formats, volumes, and compositions known in the art.

The immunizations (e.g., vaccinations) are repeated at least about two times after the initial inoculation with identical polypeptide in the same amount and concentration. Each subsequent inoculation or immunization with the polypeptide is termed a “boost.” The second inoculation or immunization (e.g., the first “boost” following the initial inoculation) is administered about 2 weeks to 1 month after the initial inoculation, and a third inoculation (e.g., the second “boost”) is administered about 1 to about 6 months following the second inoculation (most typically from about 3 months, 4 months, or about 6 months). Control animals are administered with a carrier (e.g., pharmaceutical carrier) alone in the same volume during the second and third inoculations following the initial inoculation.

Alternatively, for another group(s) of monkeys, one or more DNA or RNA sequences of the invention (as described above) can be used for one or more of the inoculation boosts following the initial inoculation, as described in Example 27. The effectiveness of boosts using a polypeptide antigen of the invention versus an immunogen-encoding or antigen-encoding DNA or RNA sequence of the invention is compared.

Following the procedures set forth in Example 27, sera are obtained from each monkey, and such sera are tested for the presence of antibodies specific for each dengue virus serotype (DEN-1, DEN-2, DEN-3, and DEN-4) by ELISA assay and/or PRNT assay. The ability of sera of inoculated monkeys to neutralize each of the four dengue virus serotypes is determined by PRNT assay.

Effectiveness of the prophylactic administration of such recombinant polypeptide(s) of the invention is assessed by challenge of the immunized monkeys with one or more dengue viruses, preferably of multiple dengue virus serotypes, and observation of the occurrence of viremia (including, e.g., onset, levels, and duration of viremia) or nonoccurrence of viremia in such monkeys thereafter, as described in Example 27.

The administration of an effective amount (e.g., immunogenic amount or antigenic amount) of one or more recombinant polypeptides of the invention to a nonhuman primate induces a protective immune response against in vivo challenge by at least one dengue virus. Protection may be full or partial protection. Full protection is observed in those immunized monkeys that are completely protected from developing viremia; partial protection is observed in those immunized monkeys that only develop a reduced viremia compared to control monkeys.

The present invention includes recombinant PRM15/tE dengue virus antigen fusion proteins and recombinant C15/full length prM/full length E dengue virus antigen fusion proteins (and compositions thereof) that induce protective immune response(s) in vivo against dengue virus infection by at least one dengue virus serotype, preferably at least two dengue virus serotypes, more preferably at least three or at least four dengue virus serotypes, following in vivo viral challenge by one or more dengue virus serotypes, in a non-human primate model, e.g., rhesus macaque monkey model.

In another aspect, the invention also includes recombinant truncated E dengue virus antigen fusion proteins and recombinant full length prM/full length E dengue virus antigen fusion proteins (and compositions thereof) that induce protective response(s) in vivo against dengue virus infection by at least one dengue virus serotype, preferably at least two dengue virus serotypes, more preferably at least three or at least four dengue virus serotypes, following in vivo viral challenge by one or more dengue virus serotypes, in a non-human primate model. In such aspect, the encoded recombinant polypeptide does not include the PRM15 signal peptide or C15 signal peptide.

Example 29

This example illustrates a method of immunization of a human with a composition comprising a pharmaceutically acceptable carrier (e.g., PBS) and at least one recombinant immunogen-encoding or antigen-encoding DNA or RNA sequence of the invention or with at least one human codon optimized DNA or RNA sequence of the invention described herein. For example, any recombinant PRM15/tE fusion protein-encoding nucleotide sequence, including, e.g., but not limited to, any of SEQ ID NOS:157 (clone 2G11), 158 (clone 5/21), 159 (clone 6E12), 185 (clone 16B4), 187 (clone 16G11), 200 (clone 18H2), 172 (clone 18E11), 235 (clone 18H6), or any composition or mixture of such nucleotide sequences, or any recombinant C15/full length prM/full length E fusion protein-encoding nucleotide sequence, including, e.g., but not limited to, any of SEQ ID NOS:254 (clone 16G11-D4), 255 (clone 16G11-25B10ext), 256 (clone 18H6-D4), 257 (clone 18H6-25B10ext), 204 (2G11-D4), and 202 (6E12-D4), or any mixture of such nucleotide sequences, can be employed in this method. In another aspect, a composition comprising a pharmaceutically acceptable carrier and at least one recombinant PRM15/tE fusion protein-encoding nucleotide sequence and at least one recombinant C15/full length prM/full length E fusion protein-encoding nucleotide sequence can be employed (e.g., 18H6 (SEQ ID NO:235), 2G11-D4 (SEQ ID NO:204), and 6E12-D4 (SEQ ID NO:202). Other immune-stimulating polynucleotides of the invention or combinations of such polynucleotides can also be used in this method.

Immunization of a human with one or more recombinant DNA or RNA sequences of the invention is done using a method similar to that described in Example 27. Prophylactic administration of an effective amount (e.g., immunogenic amount or antigenic amount) of one or more such DNA or RNA sequences of the invention to a human results in the induction or enhancement of an immune response against at least one, and preferably at least two, at least three, or at least four, dengue viruses in the human. For such polynucleotides of the invention, a protective immune response against one or more dengue viruses is induced or enhanced in the human recipient. Such response may be a partially protective response, but is preferably a fully protective response. Such polynucleotide or combination of several different recombinant polynucleotides serves as a DNA vaccine that induces an immune response(s) that protects against infection by at least one, preferably at least two, and more preferably at least three or four dengue virus serotypes.

In addition, the invention provides methods of prophylactic immunization of humans with one or more recombinant dengue virus polypeptides of the invention in, e.g., a suitable carrier, such as a pharmaceutically acceptable carrier (PBS), and optionally any acceptable adjuvant, such as alum. Immunization of a human with one or more recombinant polypeptides of the invention is conducted using a method similar to that described in Example 27. Any recombinant polypeptide of the invention or combination of such polypeptides can be used for human immunization; for example, such polypeptide may comprise one or more of the following sequences: SEQ ID NOS:66 (clone 2G11), 67 (clone 5/21), 69 (clone 6E12), 89 (clone 16B4), 93 (clone 16G11), 109 (clone 18H2), 108 (clone 18E11), and 110 (clone 18H6) or 251 (16G1-25B10). Alternatively, the recombinant polypeptide that is administered comprises a recombinant tE dengue virus polypeptide antigen and does not include a PRM15 signal peptide. In another aspect, a combination of two or more any of the recombinant polypeptides of the invention can be administered.

Such polypeptides can be administered to a human in addition to a polynucleotide or, alternatively, in place of such polynucleotide. The recombinant polypeptide is administered to the human in an amount effective to induce or enhance an immune response(s) in the human. The effective amount comprises, e.g., immunogenic amount or antigenic amount. Suggested doses were described above. Such antigenic polypeptides may serve as protein vaccines. Administration of the polypeptide can be via a variety of routes, as described above; transdermal and intramuscular administration via injection are among the typical routes of polypeptide delivery.

Amounts of recombinant dengue virus polynucleotides and polypeptides used for the immunization of rhesus macaque monkeys in Examples 27 and 28 can be similarly used for the immunization of humans; alternatively, such amounts can be modified proportionally to take into account the size of the human relative to that of the macaque monkey. For example, the dose of dengue virus peptide antigen for immunization of a human may be slightly higher (e.g., about 0.5 μg-1 mg, about 0.5-500 μg, about 0.5-100 μg, about 1-50 μg, about 1-10 μg, about 5-30 μg, about 15-25 μg) compared to the immunization dose used for a monkey. The prophylactic administration of an effective amount of one or more recombinant dengue virus polypeptide antigens of the invention induces in a human a protective immune response against at least one, preferably at least two, and more preferably at least three or at least four, dengue viruses.

Such polypeptides and polynucleotides of the invention can be formulated as a composition with a carrier or excipient, including as a pharmaceutical composition, as described herein, and can be administered to a human by a variety of routes as described herein (see also Examples 27 and 28). Such polynucleotides can be delivered via a vector as described herein (see, e.g., Example 27). The recombinant dengue virus polypeptide of the invention can be administered alone (e.g., as a protein vaccine) or in combination (before or after) with administration of a plasmid vector comprising the corresponding polypeptide-encoding polynucleotide of the invention. Other formats are set forth in Example 28.

Similarly, the polynucleotide can be administered alone via a plasmid vector (e.g., as a DNA vaccine) or in combination (before or after) with administration of the corresponding encoded polypeptide of the invention, and a variety of boosting strategies and methodologies can also be employed, as described in, e.g., in Examples 27 and 28. For example, at least one such recombinant polynucleotide (e.g., recombinant DNA vaccine) or at least such recombinant polypeptide (e.g., recombinant protein vaccine) can be administered to a human initially in an amount effective to induce an immune response, and after an appropriate time period (which can be determined by one of skill in the art) (e.g., about one week, about one month, about 4, about 5, about 6, about 12, or about 18 months, etc.), a “boost” immunization comprising the same or a different amount of the polynucleotide (DNA vaccine) or polypeptide (protein vaccine) can be administered to the same human. If desired, multiple boosts can be administered to the human.

In another aspect, the invention provides methods comprising the administration to a human of an effective amount of a composition comprising at least one nucleic acid encoding at least one dengue virus antigen of the invention in combination with one or more nucleic acids encoding one or more WT dengue virus prM/E fusion proteins, WT dengue virus PRM15/tE fusion proteins, and/or WT C15/full length prM/full length E fusion proteins, and, optionally, a nucleotide encoding a co-stimulatory molecule or cytokine, such as GM-CSF or an interferon, or combination of such co-stimulatory molecule-encoding nucleotides and/or cytokine-encoding nucleotides, wherein the amount is sufficient to induce a protective immune response against at least one, at least two, and preferably at least three or at least four dengue virus serotypes.

In yet another aspect, the invention provides methods comprising the administration to a human of an effective amount of a composition comprising at least one recombinant dengue virus antigen of the invention in combination with one or more WT dengue virus prM/E fusion proteins, WT dengue virus PRM15/tE fusion proteins, and/or WT C15/full length prM/full length E fusion proteins, and, optionally, one or more co-stimulatory molecules or cytokines, such as GM-CSF or an interferon, or combination of such co-stimulatory molecules and/or cytokines, wherein the amount is sufficient to induce a protective immune response against at least one, at least two, and preferably at least three or at least four dengue virus serotypes.

In all such methods and formats described herein, the resulting induced immune response(s) induced in the human who received the one or more immunizations of a polynucleotide(s) and/or polypeptide(s) can be determined at various times, including before and/or after the various immunizations, by measuring, e.g., the antibody titer level in serum (blood) obtained from the human using ELISA assays and/or PRNT assays described above.

The sequence listing is shown below.

SEQ NO. NAME SEQUENCE 1. 2/7-NPRM MRCIGISNRDPVEGVSGATWVDVVLEHGSCVTTMAKNKPTLDIELIKTEATQPATLRK “NPRM” = YCIEAKLTNTTTESRCPTQGEPYLKEEQDQNYVCKHTYVDRGWGNGCGLPGKGSLITC no prM AKFKCVTKLEGKIVQYENLKYSVIVTVHTGDQHQVGNETTEHGTTATITPQAPTSETQ seq LTDYGALTLDCSPRTGLDFNRVVLLTMKKKTWLVHKQWFLDLPLPWTAGASTSQETWN RKELLVTFKNAHAKKQEVVVLGSQEGAMHTALTCATEVDSGDGNLLFTGHLKCRLKMD KLQLKGMSYSMCTGKFQIVKEIAETQHGTIVIRVQYEGDGAPCKTPFSTEDGQGKAHN GRLITANPIVIDKEKPVNTELEPPFGDSYIVVGAGDKALKTNWYKKGSSIGKMFESTY RGAKRMAILGETAWDFGSVGGVFTSIGKALHQVFGATYGA 2. 2G11-NPRM MRCVGIGNRDFVEGLSGATWVDVVLEHGGCVTTMAKNKPTLDFELIKTTAKEVALLRT (no prM YCIEASISNTTTATRCPTQGEPYLKEEQDQQYICRRDVVDRGWGNGCGLFGKGSLTTC seq) AKFKGVTKLEGNIVQPENLEYTIVITPHTGDQHQVGNDTQGVTVEITPQASTVEATLP EYGTLGLECSPRTGLDFNENILLTMKNKAWMVHRQWFFDLPLPWTSGATTETPTWNRK ELLVTFKNAHAKKQEVVVLGSQEGAMHSALAGATEIQMSSGNLLFTGHLKCRLKMDKL QLKGMSYTMCTGKFQTVKETAETQRGTTLIKVEYKGEDAPCKIPFSTEDEKGVTQNGR LTTANPIVTEKDSPVNTEAEPPFGESYTVVGAGEKALKLSWFKKGSSIGQMFESTYRG AKPPIAILGETAWDFGSVGGLLTSLGKAVHQVFGSVYTTMFG 3. 5/21-NPRLM MRCVGVGNRDFVEGLSGATWVDVVLEHGSGVTTMAKNKPTLDTELLKTEVTNPATLRK (no prM YCIEAKITNITTDSRCPTQGEAILPEEQDQNYVCKHTYVDRGWGNGCGLFGKGSLVTC seq) AKFTCKKNMEGNIVQPENLEYTIVITPHSGEEHAVGNDTGKHGKEIKITPQSSITEAE LTGYGTVTMECSPRTGLDFNRVVLLTMKKKSWLVHKQWFLDLPLPWTAGADTSEVHWN HKERMVTFKVPHAKRQDVTVLGSQEGAMHSALAGATEIQMSSGNLLFTGHLKCRLRMD KLQLKGMSYVMCTGKFQIVKEIAETQHGTIVIRVQYEGDGSPCKIPLEIMDLEKRHVL GRLITVNPIVIDKEKPVNIEAEPPFGESYIVVGVGDSALTLHWFRKGSSIGQMFETTM RGAKRMAILGDTAWDFGSLGGVFTSIGKALHQVPGATYGA 4. 6C6-NPRM MRCVGIGNRDFVEGVSGGAWVDLVLEHGGCVTTMAKNKPTLDFELIKTTAKEvALLRT (no prM YCIEASISNITTATRCPTQGEPYLKEEQDQQYTCRRDVVDRGWGNGCGLFGKGGIVTC seq) AMFTCKKNMEGNTVQPENLEYTIVTTPHSGEEHAVGNDTGKHGKEIKITPQSSITEAE LTGYGTVTMECSPRTGLDFNEMILMKMKNKAWMVHRQWFLDLPLPWTAGADTETPTWN RKELLVTFKNAHAKKQEVVVLGSQEGAMHSALAGATEVDSGDGNHMFAGHLKCKVRMD KLQLKGMSYSMCTGKFQTVKETAETQHGTIVIRVQYEGDGSPCKTPLEIMDLEKRHVL GRLTTVNPIVTEKDSPVNIEAEPPFGDSYIITGVEPGQLKLHWFKKGSSIGKMFEATA RGARRMAILCDTAWDFGSAYT 5. 6E1 2-NPRM MRCVGVGNRDFVEGLSGATWVDVVLEHGSCVTTMAKNKPTLDIELLKTEVTNPAVLRK (no prM LCIEASISNITTATRCPTQGEAILPEEQDQNYVCKHTYVDRGWGNGCGLFGKGSVVTC seq) AKFQCLEPIEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGVTVEITPQASTVEAILP EYGTLGLECSPRTGLDFNEMILLTMKNKAWMVHRQWFFDLPLPWTAGADTSEVHWNHK ERMVTFKTAHAKKQDVVVLGSQEGAMHTALTGATEIQMSSGTTTIFAGHLKCKVRMDK LTLKGVSYVMCTGSFKLEKEVAETQHGTVLVQVKYEGTDAPCKIPFSSQDEKGVTQNG RLITANPIVIDKEKPVNIEAEPPFGDSYIIIGVEPGQLKLHWFKKGSSIGQMFETTMR GAKRMAILGDTAWDFGSVGGLLTSLGKMVHQIFGSVYTTMFG 6. 6F4-NPRM MRCVGIGNRDFVEGVSGGAWVDLVLEHGGCVTTMAKNKPTLDFELIKTEATQPATLRK (no prM YCIEAKLTNTTTESRCPTQGEAILPEEQDQQYICRRDVVDRGWGNGCGLFGKGSLITC seq) AKFQCLEPIEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGVTVEITPQASTVEAILP EYGTLGLECSPRTGLDFNEMILLTMKKKSWLVHKQWFLDLPLPWTAGADTSEVHWNHK ERVTFKVPHAKRQDVTVLGSQEGAMHSALAGATEVDSGDGNHMFAGHLKCKXTPIEKL RIKGMSYSMCTGKFQIVKEIAETQHGTIVIRVQYEGDGSPCKIPLEIMDLEKRHVLGR LITVNPIVTEKDSPVNIEAEPPFGDSYIIIGVEPGQLKLHWFKKGSSIGQMFEATARG ARRMAILGDTAWDFGSVGGVFTSIGKALHQVFGAIYGA 7. 7A9-NPPPI MRCVGIGNRDFVEGVSGATWVDVVLEHGGCVTTMAKNKPTLDIELQKTEATQLATLRK (no prM LCIEASISNITTATRCPTQGEATLVEEQDTNFVCRRTFVDRCWGNGCGLFGKGSLVTC seq) AKFQCLEPIEGKVVQHENLKYTIVITPHSGEEHAVGNDTGKHGKEIKITPQSSITEAE LTGYGTVTMECSPRTGLDFNEMILLTMKSKTWLVHKQWFLDLPLPWTAGADTSEVHWN HKERMVTFKNAHAKKQEVVVLGSQEGAIHTALTGTTEIQNSGNLLFTGHLKCRLRMDK LQLKGMSYSMCTGKFQIVKEIAETQHGTIVIRVQYEGDGSPCKIPLEIMDLEKRHVLG RLITVNPIVTEKDSPVNIEAEPPFGDSYIITGVEPGQLKLHWFRKGSSIGKMFESTYR GAKRMAILGETAWDFGSVGGLLTSLGKAVHQVFGSVYTTMFG 8. 11B1-NPRM MRCTGISNRDFVEGVSGATWVDVVLEHGSCVTTMAKNKPTLDTELIKTEATQPATLRK (no prM YCIEAKLTNTTTATRCPTQGEATLPEEQDQNYVCKHTYVDRGWCNGCGLFGKGSVVTC seq) AKFQCLEPIEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGVTVEITPQASTVEAILP EYGTLGLECSPRTGLDFNEMILLTMKNKAWMVHRQWFFDLPLPWTSGATTETPTWNHK ERMVTFKVPHAKRQDVTVLGSQEGAMHTALTGATEVDSGDGNHMFAGHLKCRLKMDKL QLKGMSYSMCTGKFQIVKEIAETQHGTIVIRVQYEGDGSPCKIPLETIDLEKRHVLGR LITANPTVTEKDSPVNIEAEPPFGDSYTIIGVEPGQLKLHWFRKCSSIGQMFESTYRG AKRIVIAILGETAWDFGSVGGLLTSLGKAVHQVFGAIYGA 9. 11B8-NPRM MRCVGVGNRDFVEGLSGATWVDVVLEHGSCVTTMAKNKPTLDIELLKTEVTNPATLRK (no prM YCTEAKLTNTTTESRCPTQGEPYLKEEQDQNYVCKHTYVDRGWGNGCGLFGKCSVVTC seq) AKFQCLEPIEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGVTVEITPQASTVEATLP EYGTLGLECSPRTGLDFNRVVLLTMKKKTWLVHKQWFLDLPLPWTAGADTETPTWNRK ELLVTFKNAHAKKQEVVVLGSQEGAMHSALAGATEIQMSSGNHMFAGHLKCKVRMEKL QLKGMSYVMCTGKFQIVKEIAETQHGTIVIRVQYEGDGAPCKIPFSTEDGQGKAHNGR LITANPIVIDKEKPVNIELEPPFGDSYIVVGAGDKALKINWYKKGSSIGQMFEATARG ARRMAILGDTAWDFGSVGGLLTSLGKMVHQIFGSVYTTMFG 10. 11C4-NPRM MRCVGIGNRDFVEGLSGATWVDVVLEHGSCVTTMAKNKPTLDIELLKTEVTNPATLRT (no prM YCIEASTSNTTTESRCPTQGEPYLKEEQDQNYVCKHTYVDRGWGNGCGLFGKGSVVTC seq) AKFQCLEPIEGKVVQHENLKYTIVTTPHSGEEHAVGNDTGKHGKEIKITPQSSITEAE LTGYGTVTMECSPRTGLDFNEMILLTMKSKTWLVHKQWFLDLPLPWTAGADTSQETWN RKELLVTFKNAHAKKQEVVVLCSQEGAIHTALTGATEVDSGDGNLLFTGHLKCRLKMD KLQLKGMSYSMCTGKFQIVKEIAETQHGTIVIRVQYECDGSPCKIPLEIMDLEKRHVL GRLITVNPIVTEKDSPVNIEAEPPFGDSYIIIGVGDKALKLHWFKKGSSTGKMFEATA RGAKRMAILGETAWDFGSAYT 11. 11C11- MRCVGVGNRDFVEGLSGATWVDVVLEHGSCVTTMAKNKPTLDIELLKTEVTNPAVLRK NPRM LCIEASISNITTESRCPTQGEPYLKEEQDQNYVCKHTYVDRGWGNGCGLFGKGSLITC (no prM AKFKCVTKLEGKIVQYENLKYSVIVTVHTGDQHQVGNETQGVTVEITPQASTVEAILP seq) EYGTLGLECSPRTGLDFNEMILLTMKKKTWLVHKQWFLDLPLPWTAGAURSQETWNRK ELLVTFKNAHAKKQEVVVLGSQEGAMHTALTGATEVDSGDGNLLFTGRLKCRLKMDKL QLKGMSYSMCTCFQI\TKEIAETQHGTIVIRVQYEGEDAPCKIPFSTEDGQGKAHNCR LITANPIVTEKDSPVNIEAEPPFGDSYIVVGAGEKALKLHWFKKGSSIGQMFEATARG ARRMAILGDTAWDFGSVGGVFTSIGKALHQVFGATYGA 12. 11E2-NPRM MRCVGVGNRDFVEGLSGATWVDVVLEHGSCVTTMAKNKPTLDIELLKTEVTNPAVLRK (no pRM LCIEASTSNITTATRCPTQGEAILPEEQDQNYVCKHTYVDRGWGNGCGLFGKGSLVTC seq) AKFKCVTKLEGKIVQYENLKYSVIVTVHTGDQHQVGNETTEHGTIATITPQAPTSEIQ LTDYGALTLDCSPRTGLDFNRVVLLTMKKKTWLVIKQWFLDLPLPWTAGASTSQETWN RKELLVTFKNAHAKRQDVTXLGSQEGAMHSALTGATEIQMSSGNLLFAGHLKCRLKMD KLQLKGMSYSMCTGKFQIVKEIAETQHGTIVIRVQYECDGAPCKIPLEIMDLEKRHVL GRLITVNPIVTEKDSPVNIEAEPPFGESYIVVGVGDSALTLHWFRKGSSIGQMFEATA RGAKRMAILGDTAWDFGSVGGVFTSTGKALHQVFGAIYGA 13. 12E3-NPRM MRCVGVGNRDFVEGLSGATWJDVVLEHGSCVTTMAKNKPTLDIELIKTTAKEVALLRT (no pRM YCIEASISNITTATRCPTQGEAILPEEQDQNYVCKHTYVDRGWGNGCGLFGKGSLVTc seq) AKFTCKKNMEGNIVQPENLEYTIVITPHSGEEHAVGNDTGKHGKEIKITPQSSITEAE LTGYGTVTMECSPRTGLDFNRVVLLTMKKKSWLVHKQWFLDLPLPWTAGASTSQETWN RKELLVTFKNAHAKKQEVVVLGSQEGANIHTALTGTEIQMSSGNLLFTGHLKCRLRMD KLQLKGMSYSMCTGKFQIVKEIAETQHGTIVIRVQYEGDGAPCKIPFSTEDGQGKAHN GRLITANPIVIDKEKPVNIELEPPFGESYIVVGAGDKALKINWYKKGSSIGKMFESTY RGAKRMAILCETAWDFCSVGGLLTSLGKAVHQVFGSVYTTMFG 14. 12H4-NPRM MRCIGISNRDFVEGVSGATWVDVVLEHGSCVTTMAKNKPTLDIELIKTEATQPATLRK (no pRM YCIEASISNTTTDSRCPTQGEPYLKEEQDQQYICRRDVVDRGWGNGCGLFGKGSLITC seq) AKFKCVTKLEGKIVQYENLKYSVIVTVHTGDQHQVGNDTQGVTVEITPQAsTVEATLP EYCTLGLECSPRTGLDFNEMILLTNKKKSWLVHKQWFLDLPLPWTAGADTETPTWNRK ELLVFKNAHAKKQEVVVLGSQEGAMHTALTGATEIQMSSGNLLFTGHLKCRLKLMDKL QLKGMSYSMCTGKFQIVKETAETQHGTIVTRVQYECDGAPCKIPFSTEDEKGVTQNGR LITANPIVTEKDSPVNIEAEPPFGDSYTITGVEPGQLKLHWFKKGSSIGQMFEATARG ARRMAILGDTAWDFGSAYT 15. 13E2-NPRM MRCVGIGNRDFVEGVSGATWVDVVLEHGGCVTTMAKNKPTLDFELIKTTAKEVALLRT (no pRM YCTEASTSNITTATRCPTQGEPYLKEEQDQQYICRRDVVDRGWGNGCGLFGKGSLITC seq) AKFKCVTKLEGNIVQPENLEYTIVITPHSGEEIAVGNDTGKHGKEIKITPQSSITEAE LTGYGTVTMECSPRTGLDFNEMILLTMKKKSWLVHKQWFLDLPLPWTAGASTSQETWN RKELLVTFKNAHAKKQEVVVLGSQEGANHTALTGATEIQMSSGNLLFTGHLKCRLRMD KLQLKGMSYTMCTGKFQIVKEIAETQHGTIVTRVQYEGTDAPCKIPFSTEDCQGKAHN GRLITANPIVTDKEKPVNIELEPPFGDSYITTGVEPCQLKLHWFKKGSSIGKMFEATA RGAKRMAILGETAWDFGSVGGLLTSLGKMVHQIFGSVYTTMFG 16. 13E11- MRCVGVGNRDFVEGVSGGAWVDVVLEHGGCVTTMAKNKPTLDIELIKTEATQPATLRK NPRI4 YCIEASISNITTDTRCPTQGEPYLKEEQDQNYVCKHTYVDRGWGNGCGLFGKGSLITC (no pRM AKFKCVTKLEGKIVQYENLKYSVIVTVHTGDQHQVGNETTEHGTIATITPQAPTSETQ seq) LTDYGTVTMECSPRTGLDFNEMILLTMKSKTWLVHKQWFLDLPLPWTAGADTETPTWN RKELLVTFKNAHAKKQEVVVLGSQEGAMHTALAGATEIQMSSGNLLFTGHLKCRLKMD KLQLKGMSYTMCTGKFQIVKEIAETQHGTIVTRVQYEGTDAPCKIPFSTEDGQGKAHN GRLITANPTVIDKEKPVNIELEPPFGESYIVVGAGEKALTLHWFRKGSSIGQMFETTM RGAKPIVIAILGETAWDFGSVGGVFTSIGKALHQVFGATYGA 17. 13F11- MRCVGIGNRDFVEGVSGGAWVDLVLEHGGCVTTMAKNKPTLDIELIKTEATQPATLRK NPRM YCIEAKITNITTDSRCPTQGEPYLKEEQDQQYTCRRDVVDRGWGNGCGLFGKGSLITC (no pRM AKFKCVTKLEGKIVQYENLKYSVIVTVITGDQHQVGNETTEHGTIATITPQAPTSEIQ seq) LTDYGALTLDCSPRTGLDFNRVVLLTMKKKTWLVHKQWFLDLPLPWTAGADTSEVHWN HKERIVTFKVPHAKRQDVTVLGSQEGAMHSALAGATEVDSGDGNINFAGHLKCRLKMD KLQLKGMSYSMCTGKFQIVKEIAETQHGTIVIRVQYEGDGAPCKIPFSTEDEKGKAHN GRLITANPIVIDKEKPVNIELEPPFGESYIVVGAGEKALKLSWFKKGSSIGKMFESTY RGAKRMAILGDTAWDFGSVGGVETSIGKALHQVFGATYGA 18. 14B1-NPRM MRCVGISNRDFVEGVSGGAWVDLVLEHGGCVTTMAKNKPTLDIELLKTEVTNPATLRK (no pRM YCIEAKTTNITTDSRCPTQGEPYLKEEQDQQYICRRDVVDRGWGNGCGLFGKGSLTTC seq) AKFKCVTKLEGNIVQPENLKYTTVITPHTGDQHQVGNDTQGVTVEITPQASTVEATLP EYGTLGLECSPRTGLDFNEMILLTMKKKSWLVHKQWFLDLPLPWTAGADTETPTWNRK ELLVTFKNAHAKRQDVTVLGSQEGAMHSALTGATEVDSGDGNLLFTGHLKCRLKMDKL QLKGMSYTMCTGKFQIVKEIAETQHGTIVIRVQYEGDGAPCKIPFSTEDEKGVTQNGR LITANPTVTEKDSPVNIEAEPPFGESYIVVGAGEKALKLSWFKKGSSIGQMFESTARG ARRMAILGDTAWDFGSAYT 19. 14E9-NPPM MRCVGTGNRDFVECVSGGAWVDLVLEHGGCVTTMAKNKPTLDIELIKTEATQPATLRK (no pRM YCIEAKITNITTDSRCPTQGEAILPEEQDQQYICRRDVTDRGWGNGCGLFGKGSVVTC seq) AKFQCVTKLEGNIVQPENLEYTIVITPHTGDQHQVGNDTQGVTVETTPQASTVEAILP EYGTLGLECSPRTGLDFNEMILMKMKNKAWMVHKQWFLDLPLPWTSGASTSQETWNRK ELLVTFKNAHAKKQEVVVLGSQEGAJAHSALAGATEIQMSSGNLLFTGHLKCRLKIKL QLKGMSYTMCTCKFQTVKEIAETQHGTTVIRVQYEGDGSPCKIPLEIMDLEKRHVLGR LITANPIVIDKEKPVNIELEPPFGDSYIVVGAGDKALKINWYKIGSSIGKMFESTYRG AKRMAILGETAWDFGSAYT 20. 14G10- MRCTOISNPDFVEGVSGATWVDVVLEHGGCVTTMAKNKPTLDIELLKTEVTNPATLRJ NPRM YCIEAKITNITTDTRCPTQGEATLPEEQDQQYICRRDVVDRGWGNGCGLFGKGSLVTC (no pRM AKFTCKKNMECNIVQPENLEYTIVITPHSGEEHAVGNETTEHGTIATITPQAPTSEIQ seq) LTDYGALTLDCSPRTGLDFNRVVLLTMKKKTWLVHKQWFLDLPLPWTAGADTSEVHWN KERIAVTFKVPHAKRQDVTVLGSQEGAMHSALAGATEIQMSSGNLLFTGHLKCRLKMD KLQLKGMSYSMCTGKFQIVKEIAETQHGTIVIRVQYEGDGAPCKTPFSTEDGQGKAHN GRLITANPTVIDKEKPVNTELEPPFGDSYTVVGAGDKALKINWYKKGSSIGKMFESTY RGAKRMAILGETAWDFGSVGGVFTSIGKALHQVFGAIYGA 21. 14H2-NPRM MRCVGVGNRDFVEGVSGGAWVDLVLEHGGCVTTMAKNKPTLDFELIKTTAKEVALLRT (no pRM YCIEAKITNITTATRCPTQGEPYLKEEQDQNYVCKHTYVDRGWGNGCGLFGKGSLITC seq) AKFQCVTKLEGNIVQPENLEYTIVITPHSGEEHAVGNDTQGVTVEITPQASTVEAILT DYGALTLDCSPRTGLDFNRVVLLTMKKKTWLVRKQWFLDLPLPWTAGADTSEVIWNHK ERIMVTFKVPHAKKQEVVLGSQEGAMHTALTGATEVDSGDGNLLFTGHLKCRLKMDKL QLKGMSYSMCTGKFQIVKETAETQHGTTVIRVQYKGEDAPCKIPFSSQDGQGKAHNGR LITANPIVTEKDSPVNIEAEPPFGESYIVVGAGEKALKLSWFKKGSSIGQMFETTMRG AKRMAILGETAWDFGSVGGVFTSIGKALHQVFGAIYGA 22. 15C2-NPRM MRCVGISNRDFVEGLSGATWVDVVLEHGSCVTTMAKNKPTLDIELLKTEVTNPATLRK (no pRM YCIEAKITNITTDSRCPTQGEPYLKEEQDQQYICRRDVVDRGWGNGCGLFGKGSVVTC seq) AKFQCLEPIEGKIVQYENLKYSVIVTVITGDQHQVGNDTQGVTVEITPQASTVEAILP EYGTLGLECSPRTGLDFNEMTLLTMKNKAWMVHRQWFFDLPLPWTAGADTSEVHWNHK ERMVTFKVPHAKRQDVTVLGSQEGAMHSALAGATEVDSGDGNHMFAGHLKCRLKMDKL QLKCMSYVMCTGKFQTVKEIAETQHCTIVIRVQYEGDGAPCKIPFSTEDEKGVTQNGR LITANPTVTDKEKPVNIEAEPPFGESYIVIGVEPGQLKLHWFKKGSSIGQMFEATARG ARRMATLGETAWDFGSVGGLLTSLGKAVHQVFGSVYTTMFG 23. 15D4-NPRM MRCVGIGNRDFVEGLSGATWVDVVLEHGGCVTTMAKNKPTLDFELIKTTAKEVALLRT YCIEAKTTNITTDSRCPTQGEATLPEEQDQNYVCKHTYVDRGWGNGCGLFGKGSLITC AKFKCLEPTEGKVVQPENLEYTIVITPHSGEEHAVGNDTGKHGKEIKITPQSSITEAE LTGYGTVTMECSPRTGLDFNRVVLLTMKKKSWLVHKQWFLDLPLPWTAGADTSEVHWN HKEPVTFKTAHAKKQDVVVLGSQEGA4IISALAGATETQMSSGNLLFTGHLKCRLRMD KLTLKGVSYVMCTGSFKLEKEVAETQHGTVLVQVKYEGTDAPCKIPFSTEDEKGVTQN GRLTTANPIVTEKDSPVNIEAEPPFGESYIVVGAGDKALKINWYKKGSSIGKMFESTY RGAKRMAILGDTAWDFGSLGGVFTSTGKALHQVFGAIYGA 24. 15H4-NPRM MRCVGVGNRDFVEGLSGATWVDVVLEHGSCVTTMAKNKPTLDFELIKTEVTNPATLRK YCIEASISNITTATRCPTQGEANLPEEQDQNYVCKHTYVDRGWGNGCGLFGKGSLVTC AKFQCLEPTEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGVTVEITPQASTVEAILP EYGTLGLECSPRTGLDFNRVVLLTMKKKSWLVHKQWFLDLPLPWTAGADTSEVHWNHK ERMVTFKTAHAKKQEVVVLGSQEGAMHSALAGATEIQMSSGNLLFTGRLKCRLKMDKL QLKGMSYTMCTGKFQIVKEIAETQHGTIVTRVQYEGDGAPCKIPLEIMDLEKRHVLGR LTTVNPIVIDKEKPVNIEAEPPFGDSYIIIGVEPGQLKLIWFKKGSSIGQMFESTYRG AKRMATLGDTAWDFGSVGGVLNSLGKMVHQIFGSVYTTMFG 25. 16B4-NPRM MRCVGVGNPDFVEGLSGATWVDVVLEHGSCVTTMAKNKPTLDTELLKTEVTNPAVLRK LCIEASISNITTDSRCPTQGEAILPEEQDQNYVCKHTYVDRGWGNGCGLFGKGSLITC AKFKCLEPIEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGVTVEITPQASTVEAILP EYGTLGLECSPRTGLDFNEMILLTMKKKTWLVHKQWFLDLPLPWTAGADTSEVHWNHK ERI4VTFKTAHAKKQDVVVLGSQEGAMHTALTGATIQMSSGTTTIFAGHLKCRLKMDK LQLKGMSYTMCTGKFQITKEIAETQHGTILIKVEYKGEDAPCKTPFSTEDEKGVTQNG RLITANPTVTEKDSPVNIEAEPPFGESYIVVGVGDKALKINWYKKGSSIGQMFETTMR GAKRMATLGDTAWDFGSVGGLLTSLGKMVHQIFGSVYTTMFG 26. 16E8-NPPM MRCVGVGNRDFVEGLSGATWVDVVLEHGGCVTTMAKNKPTLDFELIKTTAKEVALLRT YCIEASISNTTTATRCPTQGEPYLKEEQDQQYICRRDVVDRGWGNGCGLFGKGSLITC AKFKCVTKLEGNTVQPENLEYTIVITPHTGDQHQVGNDTQGVTVETTPQASTVEAILP EYCTLGLECSPRTGLDFNEMILLTNKNKAWMVHKQWFLDLPLPWTSGATTETPTWNRK ELLVTFKTAHAKKQDVVVLGSQEGAMHTALTGATEIQMSSGNLLFTGHLKCRLRMDKL QLKGMSYVMCTGKFQIVKEIAETQHGTTLIKVEYKGEDAPCKIPFSTEDGQGKAHNGR LITANPIVIDKEKPVNIEAEPPFGESNIVIGIGDKALKINWYKKGSSIGKMFETTMRG AKRMAILGDTAWDFGSVGGVFTSIGKALHQVFGAIYGA 27. 16E10- MRCIGISNRDFVEGVSGATWVDVVLEHGSCVTTMAKNKPTLDIELLKTEVTNPATLRK NPRM YCIEAKISNTTTDSRCPTQGEATLVEEQDQNYVCKHTYVDRGWGNCCGLFGKGSLITC (no pRM AKFKCVTKLEGKIVQYENLKYSVITTVHTGDQHQVGNDTQGVTVEITPQASTVEAILP seq) EYGALGLECSPRTGLDFNEMILLTNKNKAWMVHGQWFFDLPLPWTSGATTETPTWNRK ELLVTFKNAHAKKQEVVVLGSQEGAMHSALAGATEIQMSSGNLLFTGHLKCRLRMDKL QLKGMSYSMCTGKFQIVKEIAETQHGTIVIRVQYEGDGSPCKIPLEIMDLEKRHVLGR LITVNPIVIDKEKPVNIEAEPPFGESYIVVGVGDSALTLHWFRKGSSIGQMFETTMRG AKRMAILGDTAWDFGSVGGVLNSLGKMVIIQIFGSVYTTMFG 28. 16F12- MRCVGVGNGDFVEGLSGATWVDVVLEHGSCVTTMAKNKPTLDFELIKTIAKEVALLRT NPRM YCTEASISNTTTDSRCPTQGEAILPEEQDQNYVCKHTYVDRGWGNGCGLFGKGSVVTC AKFQCLEPIEGKVVQPENLEYTIVTTPHTGDQHQVGNDTQGVTVEITPQASTVEAILP EYGTLGLECSPRTGLDFNEMILLTMKNKAWMVHRQWFFDLPLPWTSGATTEVHWNHKE RMVTFKTAHAKKQDVVVLGSQEGTMHTALTGATETQMSSGTTTIFAGHLKCKVKMDKL QLKGMSYTMCTGKFQIVKEIAETQHGTIVIRVQYEGDGAPCKIPLETMDLEKRHVLGR LITVNPIVIDKEKPVNIEAEPPFGESYIVVGVGDSALKINWYKKGSSIGKMFESTYRG AKRMATLGETAWDFGSVGGVFTSTGKALHQVFGAIYGA 29. 16G11- MRCVGTGNRDFVEGLSGATWVDVVLEHGGCVTTMAKNKPTLDTELLKTEVTNPAVLRK NPRM LCTEASTSNTTTDSRCPTQGEATLVEEQDTNFVCRRTFVDRGWGNGCGLFGKGSVVTC AKFQCLEPIEGKVVQHENLKYTVTTTVHTGDQHQVGNDTQGVTVEITPQASTVEAILP EYGTLGLECSPRTGLDFNEMILLTMKNKAWLVHKQWFLDLPLPWTAGADTSEVHWNHK ERMVTFKTAHAKKQDVVVLGSQECAMHTALTGATEIQMSSGNLLFTGHLKCRLRMDKL QLKGMSYVMCTGKFQIVKEIAETQHGTIVIRVQYEGDGAPCKTPFSTEDGQGKAHNGR LITANPTVTDKEKPVNIELEPPFGDSYIVVGAGDKALKINWYKKGSSIGKMFEATARG ARRMAILGDTAWDFGSLGGLLTSLGKAVHQVFGSVYTTMFG 30. 17A2- MRCVGVGNRDFVEGLSGATWVDVVLEHGSCVTTMAKNKPTLDFELIKTEATQPATLRK NPRM YCIEAKITNITTDSRCPTQGEATLPEEQDQNYVCKHTYVDRGWGNCCGLFGKGSLITC (no pRM AKFKCLEPIEGKVVQPENLEYTIVTTPHTCDQHQVGNDTQGVTVEITPQASTVEAILP seq) EYGTLGLECSPRTGLDFNEMILLTMKKKSWLVHKQWFLDLPLPWTSGATTETPTWNRK ELLVTFKNAHAKRQDVTVLGSQEGAIHSALTGATEIQTSGTTTTFAGHLKCRVRMDKL QLKGMSYVMCTGSFKLEKEVAETQHGTVLVQVKYEGTDAPCKIPVSSQDEKGVTQNGR LITANPIVTEKDSPVNIEAEPPFGDSYTIIGVEPGQLKLHWFKKGSSIGQMFETTMRG AKRMATLGDTAWDLGSVGGVPTSIGKALHQVFGAIYGA 31. 17D5-NPRM MRCIGISNRDFVEGVSGATWVDVVLEHGGCVTTMARNKPTLDIELTKTEATQPATLRK (no pRM YCIEASISNITTATRCPTQGEAILPEEQDQQYICRRDVVDRGWGNGCGLFGKGSVVTC seq) AKFQCLEPTEGKVVQHENLKYTVITTVHTGDQHQVGNDTQGVTVEITPQAPTSEIQLT DYGALTLDCSPRTGLDFNRVVLLTMKKKTWLVHKQWFLDLPLPWTSGATTETPTWNRK ELLVTFKNAHAKKQEVVVLGSQEGAMHTALTGATEVDSGDGNLLFTGHLKCRLKMDKL QLKGMSYTMCTGKFQIVKEIAETQHGTILIKVEYKGEDAPCKIPFSTEDEKGVTQNGR LITANPIVTEKDSPVNIEAEPPFGESYIVVGAGDKALKINWYKKGSSIGKMFESTYRG AKPMAILGDTAWDFGSLGGVFTSIGKALHQVFGAIYGA 32. 17D11- MRCVGTGNRDFVEGLSGATWVDVVLEHGGCVTTMAQGKPTLDIELIKTEATQPATLRK NPRM YCIEAKLTNTTTESRCPTQGEPYLKEEQDQNYVCKHTYVDRGWGNGCGLFGKGSVVTC (no pRM AKFQCLEPIEGKVVQHENLKYTVHTVHTCDQHQVGNDTQGVTXTEITPQASTVEAILP seq) EYGTLGLECSPRTGLDFNEMILLTMKKKSWLVRKQWFLDLPLPWTAGASTSQETWNRK ELLVTFKNAHAKKQEVVVLGSQEGAMHTALTGATEVDSGDGNLLFTGHLKCRLKMDKL QLKGMSYTMCTGKFQIVKEIAETQHGTILIKVEYKGTDAPCKIPFSSQDEKGVTQNGR LITANPIVTEKDSPVNIEAEPPFGESYIVVGAGDKALKINWYKKGSSIGKMFESTYRG AKRMAILGETAWDFGSVGGVFTSIGKALHQVFGAIYGA 33. 17F5-NPRM MRCVGVGNRDFVEGLSGATWVDVVLEHGSCVTTMAKNKPTLDIELLKTEVTNPATLRK (no pRM YCIEAKITNITTDSRCPTQGEAILPEEQDQNYVCKHTYVDRGWGNGCGLFGKGSVVTC seq) AKFQCLEPIEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGVTVEITPQASTVEAILP EYGTLGLECSPRTGLDFNRVVLLTMKKKSWLVHKQWFLDLPLPWTAGADTSEVIWNHK ERMVTFKTAHAKKQDVVVLGSQEGAMHSALAGATETQMSSGNLLFTGHLKCRLRMDKL QLKGMSYVMCTGKFQIVKEIAETQHGTIVIRVQYEGDDAPCKIPFSTEDEKGVTQNGR LITANPIVTEKDSPVNIEAEPPFGESYIVVGVGDSALTLHWFRKGSSIGQMFETTMRG AKRMAILGDTAWDFGSLGGVFTSLGKAXTHQIFGSVYTTMFG 34. 17F11- MRCIGISNRDFVEGVSCATWVDVVLEHGGCVTTMAKNKPTLDFELIKTTAKEVALLRT NPRM YCIEASISNITTATRCPTQGEAILPEEQDQNYVCKHTYVDRGWCNGCGLFGKGSLITC (no pRM AKFQCLEPIEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGVTVEITPQASTVEAILP seq) EYGTLGLECSPRTGLDFNEMILLTMKNKAWMVHKQWLLDLPLPWTAGADTSEVHWNHK ERMVTFKVPHAKKQDVVVLGSQEGAMHTALTGATEIQMSSGNLLFTGHLKCRLRMDKL QLKGMSYSMCTGKFQIVKETAETQHGTILTKVEYKGEDAPCKTPFSTEDEKGVTHNGR LITANPIVTDKEKPVNIELEPPFGDSYIVIGVGDSALTLHWFRKGSSIGKMFESTYRG AKRMAILGETAWDFGSVGGLLTSLGKAVHQVFGSVYTTMFG 35. 17G5-NPRM MRCVGIGNRDFVEGLSGATPVDVVLEHGGCVTTMAKNKPTLDFELIKTTAKEVALLRT (no pRM YCIEASISNITTATRCPTQGEATLKEEQDQNYVCKHTYVDRGWGNGCGLFGKGSVVTC seq) AKFQCLEPIEGKVVQHENLKYTVIITVHTCDQHQVGNDTQGVTVEITPQASTVEAILP EYGTLCLECSPRTGLDFNEMILLTMKNKAWMVHRQWFFDLPLPWTAGADTSEVHWNHK ERMVTFKTAHAKKQEVVVLGSQEGAMHSALAGATEVDSGDGNHMFAGHLKCKVRMDKL TLKGVSYVMCTGSFKLEKEIAETQHGTILIKVEYKGEDAPCKTPFSTEDGQGKAHNGR LITANPIVTDKEKPVNIELEPPFGDSYIVVGAGDKALKTNWYKKGSSICKMFESTYRG AKRMAILGETAWDFGSVGGVFTSIGKALHQVFGATYGA 36. 17H3-NPRM MRCIGISNRDFVEGVSGATWJDVVLEHGSCVTTMAKNKPTLDTELLKTEVTNPATLRK (no pRM YCTEAKITNITTATRCPTQGEATLPEEQDQNYVCKHTYVDRGWGNGCGLFGKGSLVTC seq) AKFQCLEPTEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGVTVETTPQASTVEATLP EYGTLGLECSPRTCLDFNEMILLTMKKKTWMVHRQWFFDLPLPWTAGADTSEVHWNHK ERMVTFKTAHAKRQDVTVLGSQEGAMHSALTGATEVDSGDGNLLFTGHLKCRLKMDKL QLKGMSYTMCTGKFQTVKEIAETQHGTILIKVEYKGEDAPGKTPFSTEDGQGKAHNGR LITANPAVTDKEKPVNIELEPPFGDSYIVVGAGDKALKLSWFKKGSSTGQMFESTYRG AKRMAILGETAWDLGSIGGVFTSVGK 37. 17H10- MRCIGISNRDFVEGVSGGSWTDVVLEHGSCVTTMAKNKPTLDTELLKTEVTNPATLRK NPRM YCIEAKITNITTDSRCPTQGEATMPEEQDQNYVCKHTYVDRGWGNGCGLFGKGSLVTC (no pRM AKFTCKKNMEGKVVQHENLKYTVIITVITGDQHQVGNDTQGVTVEITPQASTVEATLP seq) EYGTLGLECSPRTGLDFNEMILLTMKNKAWMVHRQWFLDLPLPWTAGADTSEVHWNHK ERMVTFKTAHAKKQDVVVLGSQEGAMHTALTGATEVDSGDGNLLFTGHLKCRLKIDKL QLKGMSYVMCTGKFQIVKEIAETQHGTIVIRVQYEGDGAPCKTPFSTEDGQGKAHNGR LITANPTVIDKEKPVNIELEPPFGDSYIVVGAGDKALKINWYKKGSSICKMFEATARG ARPMAILGDTAWDFGSVGGVLTSLGKMVHQIFGSVYTTMFG 38. 17H2- MRCVGVGNRDFVEGLSGATWVDVVLEHGSCVTTMAQGKPTLDIELLKTEVTNPATLRK NPRM YCIEAKLTNTTTESRCPTQGEPYLKEEQDQQYICRRDVVDRGWGNGGGLFGKGSLITC (no pRM AKFKCVTKLEGKIVQYENLKYSVIVTVHTGDQHQVGNETTEHGTIATITpQAPTSEIQ seq) LTDYGALTLDCSPRTGLDFNRVVLLTMKKKSWLVHKQWFLDLPLPWTAGADTSEVHWN HKERIVTFKTAHAKKQDVVVLCSQEGAMHTALTGATEVDSGDGNLLFTGHLKCRLKMD KLQLKGMSYSMCTGKFQIVKEIAETQHGTIVIKVEYKGEDAPCKIPFSTEDGQGKAHN GRLITANPIVIDKEKPVNTEAEPPFGESYIVVGAGEKALKLSWFKKGSSTCQMFETTM RGAKRMAILGDTAWDFGSVGGLLTSLGKMVHQIFGSVYTTMFG 39. 18A9-NPRM MRCVGIGNRDFVEGLSGATWVDVVLEHGSCVTTMAKNKPTLDIELLKTEVTNPATLRK (no pRM YCIEAKITNTTTATRCPTQCEATLPEEQDQNYVCKHSMVDRGWGNGCGLFGKGSVVTC seq) AKFQCLEPIEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGVTVEITPQASTVEAILP EYGTLGLECSPRTGLDFNEMILLTMKNKAWMVHKQWFLDLPLPWTAGADTSEVHWNHK ERMVTFKVPHAKRQDVTVLGSQEGANISALAGATETQMSSGNLLFTGHLKCRLRMDKL TLKGVSYVMCTGKFQIVKEIAETQHGTIVIRVQYEGTDAPCKIPFSSQDEKGVTQNGR LITANPIVIDKEKPVNIELEPPFGDSYIVVGAGDKALTLHWFRKGSSIGQMFETTMRG AKRMATLGDTAWDFGSVGGLLTSLGKMVHQIFGSVFTSVGK 40. 18B7-NPRM MRCIGISNRDFVEGLSGATWVDVVLEHGSCVTTMAKNKPTLDFELIKTTAKEVALLRT (no pRM YCIEASISNITTATRCPTQGEPYLKEEQDQQYICRRDVVDRGWGNGCGLFGKGSVVTC seq) AKFQCLEPIEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGVTVEITPQASTVEAILP EYGTLGLECSPRTGLDFNEMTLLTMKNKAWMVHKQWFLDLPLPWTAGADTSEVHWNHK ERMVTFKTAKKQDVVVLGSQEGAMHTALTGATEVDSGDGNLLFTGHLKCRLRIVJDKL QLKGMSYSMCTGKFQIVKEIAETQHGTILIKVEYKGEDAPCKIPFSTEDEKGVTQNGR LITANPIVIDKEKPVNIEAEPPFGDSYTVVGAGDKALKINWYKKGSSTGKMFESTYRG AKRMATLGETAWDFGSVGGLLTSLGKAVHQVFGSVYTTMFG 41. 18D7-NPRM MRCIGISNRDFVEGVSGATWVDVVLEHGSCVTTMANKPTLDIELLKTEVTNPAVLRK (no pRM LCIEASISNITTATRCPTQGEAILPEEQDQNYVCKHTYVDRGWGNGCGLFGKGSLITC seq) AKFKCVTKLEGKIVQYENLKYSVIVTVHTGDQHQVGNDTQGVTVETTPQASTVEATLP EYCTLGLECSPRTGLDFNEMILLTMKNKAWMVHRQWFFDLPLPWTAGADTSEVHWNHK ERMVTFKTAHAKKQDVVVLGSQEGAMIITALTGATETQMSSCTTIFAGHLKCRLKMDK LQLKGMSYSMCTGKFQIVKEIAETQHGTILIKTKYEGTDAPCKIPFSSQDEKGVTQNG RLITANPIVTEKDSPVNTEAEPPFGESYI WGAGEKALKLSWFKKGSSIGQMFESTYR GAKRMAILGETAWDFGSVGGLLTSLGKAVIQVFGSVYTTMFG 42. 18E9-NPPM MRCVGVGNRDFVEGLSGATWVDVVLEHGGCVTTMAKNKPTLDFELIKTEATQPATLRK (no pRM YCIEAKLTNTTTATRCPTQGEPYLKEEQDQNYVCKIITYVDRGWGNGCGLFGKGVVTC seq) AKFQCLEPIEGKVVQHENLKYSVIVTVHTCDQHQVGNDTQGVTVEITPQASTVEAILP EYGTLGLECSPRTGLDFNEMILLTMKKKSWLVHKQWFLDLPLPWTSGASTSQETWNRK ELLVTFKNAHAKRQDVTVLGSQEGAMHTALTGATEVDSGDGNLLFTGHLKCRLKMDKL QLKGMSYTMCTGKFQIVKETAETQHGTILIKVEYKGEDAPCKIPFSSQDEKGVTQNGR LITANPIVIDKEKPVNIELEPPFGDSYIVVGAGDKALKINWYKKGSSIGKMFESTYRG AKRMAILGDTAWIJFGSVGGVLNSLGKMVHQIFGSVYTTMFG 43. 18E10- MRCVGVGNRDFVEGLSGATWVDVVLEHGGCVTTMAKNKPTLDIELQKTEATQLATLRK NPRM YCTEAKLTNTTTESRCPTQGEAILPEEQDQNYVCKHTYVDRGWGNGCGLFGKGSLITC (no pRM AKFKCVTKLEGKIVQYENLKYSVIVTVHTGDQHQVGNETTEHGTIATTTPQAPTSEIQ seq) LTDYGALTLDCSPRTGLDFNRVVLLTMKKKSWLVHKQWFLDLPLPWTAGADTSEVHWN HKERMVTFKTAHAKKQEVVVLGSQEGAMHTALTGATEVDSGDGNLLFTGHLKCRLRMD KLQLKGMSYSMCTGKFQIVKEIAETQHGTIVIRVQYEGDGSPCKIPLEIMDLEKRHVL GRLITANPIVTDKEKPVNIELEPPPGDSYIVVGAGDKALKINWYKKGSSIGQMFETTM RGAKRMAILGDTAWDFGSVGGLLTSLGKAVHQVFGSVYTTMFG 44. 18E11- MRCVGVGNRDFVEGLSGATWVDVVLEHGGCVTTMAKNKPTLDFELIKTTVKEVALLRT NPRM YCIEASISNITTATRCPTQGEPYLKEEQDQQYICRRDVVDRGWGNGCGLFGKGSLITC (no pRM AKFKCVTKLEGNIVQPENLEYTIVITPHTGDQHQVGNDTQGVTVEITPQASTVEATLP seq) EYGTLGLECSPRTGLDFNEMILLTMKKKSWLVHKQWFLDLPLPWTAGADTSEVHWNHK ERMVTFKTAHAKKQDVVVLGSQEGANHTALTGATEVDSGDGNLLFTGHLKCRLKMDKL QLKGMSYSMCTGKFQIVKEIAETQHGTILIKVEYKGEDAPCKIPFSTEDEKGKAHNGR LTTANPTVTDKEKPVNIEAEPPFGESYIVVGAGDKALKLSWFKKGSSIGQMFESTYRG AKRMAILGETAWDFGSVGGVFTSIGKALHQVFGAIYGA 45. 18H2-NPRM MRCVGVGNRDFVEGVSGGAWVDLVLEHGSCVTTMAKNKPTLDIELLKTEVTNPATLRK (no pRM YCIEAKITNITTDSRCPTQGEAILPEEQDQNYVCKHTYVDRGWGNGCGLFCKGSVVTC seq) AKFQCLEPIEGKWQHENLVKYTVIITVITGDQHQVGNDTSNHGVTVETTPQASTVEAI LPEYGTLGLECSPRTGLDFNRVVLLTMKKKSWLVHKQWFLDLPLPWTAGADTSEVHWN HKERMVTFKVPHAKKQDVVVLGSQEGAMHTALAGATEIQMSSGNLLFTGHLKCRLKMD KLQLKGMSYSMCTGKFQIVTKEIAETQHGTIVMVQYEGDGAPCKIPFSTEDGQGKAHN GRLTTANPVVTKKEEPVNIEAEPPFGESNIVIGIGDKALKINWYKKGSSICKMFETTM RGAKRMAILGDTAWDFGSVGGLLTSLGKMVHQIFGSVYTTNFG 46. 18H6-NPPM MRCVGVGNRDFVEGLSGATWMVVLEHGGGVTTMAYIJKPTLDFELIKTTAKEVALLRT (no pRM YCIEASISNITTATRCPTQGEAILPEEQDQNYVCKHTYVDRGWGNGCGLFGKGSLVTC seq) AKFTCKKNMEGNIVQPENLEYTIVITPHTGDQHQVGNDTQGVTVEITPQASTVEAILP EYGTLGLECSPRTGLDFNRVVLLTMKKKSWLVHKQWFLDLPLPWTAGADTSEVHWNHK ERMVTFKNAHAKRQDVTVLGSQEGANHSALAGATETQMSSGNLLFTGHLKCRLKNDKL QLKGVSYVMCTGSFKLEKEVAETQHGTVLVQVKYEGTDAPCKIPFSTEDCQGKAINGR LITVNPIVIDKEKPVNIELEPPFGESYTVVGAGEKALKLSWFKKGSSIGKMFEATARG AKRMAILGETAWDFGSVGGLLTSLGKAVHQVFGAIYGA 47. 21C1 MRCIGISNRDFVEGVSGGAWVDLVLEHGGCVTTMAQGKPTLDFELIKTTAKEVALLRT ETRTUNC YCIEASISNITTATRCPTQGEPYLKEEQDQQYICRRDVVDRGWGNGCGLFGKGGVVTC only (no AKFKCVTKLEGKIVQYENLEYTVVVTVHNGDTHAVGNDTSNIGVTATITPRSPSVEVK pRM) LPDYGELTLDCEPRSGIDFNEMILMKMKKKTWLVHKQWFLDLPLPWTAGADTSEVHWN HKERTFKVPHAKRQDVTVLGSQEGANHSATLAGATEVDSGDGNIIMFAGHLKCKVRMD KLQLKGMSYSMCTGKFQLTKEIAETQHGTIVMTVQYEGDGSPCKIPLEIMDLEKRHVL GRLITVNPIVIDKEKPVNTEAEPPFGDSYIIIGVGDSALTLHWFRKGSSIGKNFESTY RGARRMAILGETAWDFGSVGGLLTSLGKAVHQVFGSVYTTMFG 48. 23C12 MRCVGVGNRDFVEGLSGATWVDVVLEHGGCVTTMAKNKPTLDIELQKTEATQLATLRK ETRUN LCIEGKITNITTDSRCPTQGEAILPEEQDQNYVCKHTYVDRGWGNGCGLFGKGSLVTC only (no AKFQCLEPIEGKVVQHENLKYTVTITVHTGDQHQVGNDTQGVTVEITPQASTVEAILP pRM) C EYGTLGLECSPRTGLDFNEMTLLTMKNKAWNVHRQWFFDLPLPWTSGATTETPTWNRK ELLVTFKNAHAKKQEVVVLGSQEGAMHTALTGTTEIQNSGGTSIFAGHLKCRLKMDKL ELKGMSYSMCTGKFQIVKEIAETQHGTIVIRVQYEGDGSPCKIPLEIMDLEKRHVLGR LITVNPIVTEKDSPVNIEAEPPFGDSYIIIGVEPGQLKLHWFRKGSSIGQMFETTMRG AKRMAILGETAWDFGSVGGLLTSLGKAVHQVFGSVYTTMFG 49. 23F5 MRCVGIGNRDFVEGLSGGAWVDLVLEHGGCVTTMAKNKPTLDFELTKTEATQPATLRK ETRUNC YGIEAKLTNTTTESRCPTQGEAILPEEQDQNYVCKHTYVDRGWGNGCGLFGKGSLVTC only (no AKFQCLEPIEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGVTVETTPQASTVEAILP pRM) EYGTLGLECSPRTGLDFNEMILLTMKNKTWLVHKQWFLDLPLPWTAGADTREVHWNHK ERVTFKVPHAKRQDVTVLGSQEGAHSALAGATEVDSGDGNIIMIFAGHLKCRLKMDKL ELKGMSYSMCTGKFRIVKEIAETQHGTIVIRVQYEGDGSPCKIPLEIMDLEKRHVLGR LITVNPTVTDKEKPVNIEAEPPFGESYIVVGAGEKALKLSWFKKGSSIGKMFEATARG ARRMAILGDTAWDFGSIGGVFTSVGK 50. SEQUENCE M X₁ X₂ X₃ F I L X₄ M L V X₅ P S X₅ P S X₆ X₇ M R C V G X₈ G N R D F PATTERN V E G X₉ S G X₁₀ X₁₁ W V D X₁₂ V L E H G X₁₃ C V T T M A K N K P T L D X₁₄ E L X₁₅ K T X₁₆ X₁₇ X₁₈ X₁₉ X₂₀ L R X₂₂ X₂₃ C I E A X₂₄ I X₂₅ N X₂₆ T T X₂₇ X₂₈ R C P T Q G E X₂₉ X₃₀ L X₃₁ E E Q D X₃₂ X₃₃ X₃₄ X₃₅ C X₃₆ X₃₇ X₃₈ X₃₉ V D R C W G N G C C L F C K G S X₄₀ X₄₁ T C A K F X₄₂ C X₄₃ X₄₄ X₄₅ X₄₆ E C X₄₇ X₄₈ V Q X₄₉ E N L X₅₀ Y T X₅₁ X₅₂ T X₅₃ H X₅₄ G X₅₅ X₅₆ H X₅₇ V G N D T X₅₈ X₅₉ X₆₀ G X₆₂ X₆₂ X₆₃ X₆₄ I T P Q X₆₅ S X₆₆ X₆₇ E A X₆₈ L X₆₉ X₇₀ Y G T X₇₁ X₇₂ X₇₃ E C S P R T C L D F N X₇₄ 7 X₇₆ L L T M K X₇₇ K X₇₈ W X₇₉ V H X₈₀ Q W F X₈₁ D L P L P W T X₈₂ G A X₈₃ T X₈₄ X₈₅ X₈₆ X₈₇ W N X₈₈ K E X₈₉ X₉₀ V T F K X₉₁ X₉₂ H A K X₉₃ Q X₉₄ V X₉₅ V L G S Q E G A M H X₉₆ A L X₉₇ G A T F X₉₈ X₉₉ X₁₀₀ X₁₀₁ X₁₀₂ G X₁₀₃ X₁₀₄ X₁₀₅ X₁₀₆ F X₁₀₇ G H L K C X₁₀₈ X₁₀₉ X₁₁₀ M D K L X₁₁₃ L K G X₁₁₂ S Y X₁₁₃ M C T G X₁₁₄ F X₁₁₅ X₁₁₆ X₁₁₇ K E X₁₁₈ AN E T Q H G T X₁₁₉ X₁₂₀ X₁₂₁ X₁₂₂ V X₁₂₃ Y X₁₂₄ G X₁₂₅ X₁₂₆ X₁₂₇ P C K I P X₁₂₈ X₁₂₉ X₁₃₀ X₁₃₁ D X₁₃₂ X₁₃₃ X₁₃₄ X_(X) ₁₃₅ X₁₃₆ X₁₃₇ X₃₈ G R L I T X₁₃₉ N P X₁₄₀ V X₁₄₁ X₁₄₂ K X₁₄₃ X₁₄₄ P V N I E X₁₄₅ E P P F C X₁₄₆ S X₁₄₇ I X₁₄₈ X₁₄₉ C X₁₅₀ X₁₅₁ X_(152 X) ₁₅₃ X₁₅₄ L X₁₅₅ X₁₅₆ X₁₅₇ W X₁₅₈ X₁₅₉ K G S S I G X₁₆₀ M F E X₁₆₁ T X₁₆₂ R G A X₁₆₃ R M AN I L G X₁₆₄ T A W D F G S X₁₆₅ G C X₁₆₆ X₁₆₇ T S X₁₆₈ G K X₁₆₉ X₁₇₀ H Q 7 F C X₁₇₂ X₁₇₃ Y X₁₇₄ X₁₇₅ X₁₇₆ 7, X₁₇₈ wherein X₁ is a V, T, or C residue, 2 is a V or I residue, X₃ is an I or F residue, X₄ is a L or M residue, X₅ is an A or T residue, X₆ is a Y or M residue, X₇ is an A, T, or G residue, X₈ is a V or I residue, X₉ is a L or V residue, X₁₀ is an A or C residue, is an A or T residue, X₁₂ is a L or V residue, X₁₃ is a S or G residue, X₁₄ is an I or F residue, X₁₅ is a L or I residue, X₁₆ is a T or E residue, is an A or V residue, X₁₈ is a K or T residue, X₁₉ is an E or N residue, X₂₀ is a V or P residue, X₂₁ is a L, V, or T residue, X₂₂ is a K or T residue, X₂₃ is a L or Y residue, X₂₄ is a K or S residue, X₂₅ is a T or S residue, X₂₆ is an I or T residue, X₂₇ is an A or D residue, X₂₈ is a T or S residue, X₂₉ is an A or P residue, X₃₀ is an I, T, or Y residue, X₃₁ is a K or V P residue, X₃₂ is a T or Q residue, X₃₃ is a Q or N residue, X₃₄ is a F or Y residue, X_(X) ₃₅ is a V or I residue, X₃₆ is a K or R residue, X₃₇ is a H or R residue, X₃₈ is a T or D residue, X₃₉ is a V or F Y residue, X₄₀ is a L or V residue, X₄₁ is a V or I residue, X₄₂ is a K, T, or Q residue, X₄₃ is a L, V, or K residue, X₄₄ is a K, T, or E residue, X₅ is a K, F, or N residue, X₄₆ is a L, I, or N residue, X₄₇ is a K or N residue, X₄₈ is a V or I residue, X₄₉ is a H or P residue, X₅₀ is a K or E residue, X₅₁ is a V or I residue, X₅₂ is a V or I residue, X₅₃ is a V or P residue, X₅₄ is a T or S residue, X₅₅ is an E or D residue, X₅₆ is a Q or E residue, X₅₇ is an A or Q residue, X₅₈ is a S or G residue or Represents a single residue deletion, X₅₉ is a K or N residue or represents a single residue deletion, X₆₀ is a H or Q residue, X₆₁ is a V or K residue, X₆₂ is a T or E residue, X₃ is a V or I residue, X₆₄ is a K or E residue, X₆₅ is an A or S residue, X₆₆ is a T or I residue, X₆₇ is a V or T residue, X₆₈ is an I or E residue, X₆₉ is a T or P residue, X₇₀ is a G or E residue, X₇₁ is a L or V residue, X₇₂ is a T or G residue, X₇₃ is a L or M residue, X₇₄ is an R or E residue, X₁₅ is a V or M residue, X₇₆ is a V or I residue, X₇₇ is a K or N residue, X₇₈ is an A, T, or S residue, X₇₉ is a L or N residue, X₈₀ is a K or R residue, X₈₁ is a L or F residue, X₈₂ is an A or S residue, X₈₃ is a T or D residue, X₈₄ is a S or E residue, X₈₅ is a T or E residue, X₈₆ is a V or P residue, X₈₇ is a T or H residue, X₈₈ is a H or R residue, X₈₉ is a L or R residue, X₉₀ is a L or N residue, X₉₁ is a V or T N residue, ₉₂ is an A or P residue, X₉₃ is a K or R residue, X₉₄ is an E or D residue, X₉₅ is a V or T residue, X₉₆ is a T or S residue, X₉₇ is an A or T residue, X₉₈ is a V or I residue, X₉₉ is a Q or D residue, X₁₀₀ is a S or N residue, X₁₀₁ is a S or G residue, X₁₀₂ is a S or D residue, X₁₀₃ is a T or N residue, X₁₀₄ is a T residue or represents a single residue deletion, X₁₀₅ is a L or T residue, X₁₀₆ is a L or I residue, X₁₀₇ is an A or T residue, X₁₀₈ is a K or R residue, X₁₀₉ is a L or V residue, X₁₀₀ is a K or R residue, X₁₁₁ is a T or Q residue, X₁₁₂ is a V or N residue, X₁₁₃ is a V, T, or S residue, X₁₁₄ is a K or S residue, X₁₁₅ is a K or Q residue, X₁₁₆ is a L or I residue, X₁₁₇ is a V or E residue, X₁₁₈ is a V or I residue, X₁₁₉ is a V or I residue, X₁₂₀ is a L or V residue, X₁₂₁ is a V or I residue, X₁₂₂ is a K, R, or Q residue, X₁₂₃ is a K, Q, or E residue, X₁₂₄ is a K or E residue, X₁₂₅ is a T, E, or D residue, X₁₂₆ is a G or D residue, X₁₂₇ is an A or S residue, X₁₂₈ is a L or F residue, X₁₂₉ is a S or E residue, X₁₃₀ is a T, I, or S residue, X₁₃₁ is a Q, E, or M residue, X₁₃₂ is a L, G, or E residue, X₁₃₃ is a K, Q, or E residue, X₁₃₄ is a K or C residue, X₁₃₅ is a V, K, or R residue, X₁₃₆ is an A, T, or H residue, X₁₃₇ is a V, H, or Q residue, X₁₃₈ is a L or N residue, X₁₃₉ is an A or V residue, X₁₄₀ is a V or I residue, X₁₄₀ is a T or I residue, X₁₄₂ is a K, E, or D residue, X₁₄₃ is an E or D residue, X₁₄₄ is a K, S, or E residue, X₁₄₅ is an A or L residue, X₁₄₆ is an E or D residue, X₁₄₇ is a Y or N residue, X₁₄₈ is a V or I residue, X₁₄₉ is a V or I residue, X₁₅₀ is a A, V, or I residue, X₁₅₁ is a G or E residue, X₁₅₂ is a E, P, or D residue, X₁₅₃ is a K, S, or C residue, X₁₅₄ is an A or Q residue, X₁₅₅ is a K or T residue, X₁₅₆ is a L or I residue, X₁₅₇ is a S, H, or N residue, X₁₅₈ is a F or Y residue, X₁₅₉ is a K or R residue, X₁₆₀ is a K or Q residue, X₁₆₀ is an A, T, or S residue, X₁₆₂ is an A, Y, or M residue, X₁₆₃ is a K or R residue, X₁₆₄, is an E or D residue, X₁₆₅ is a L or V residue, X₁₆₆ is a L or V residue, X₁₆₇ is a L or F residue, X₁₆₈ is a L or I residue, X₁₆₉ is an A or M residue, X₁₇₀ is a L or V residue, X₁₇₁ is a V or I residue, X₁₇₂ is an A or S residue, X₁₇₃ is a V or I residue, X₁₇₄ is a T or C residue, X₁₇₅ is an A or T residue, X₁₇₆ is a M residue or represents a single residue deletion, X₁₇₇ is a F residue or represents a single residue deletion, and X₁₇₈ is a G residue or represents a single residue deletion, 51. SEQUENCE M X₁ X₂ X₃ F I L X₄ M L V X₅ P S X₆ M₇ M R C X₈ G X₉ X₁₀ N X₁₁ PATTERN 2 D F V E G X₂ S G X₁₃ X₁₄ W V D X₁₅ VLE H G X₁₆ C V T T M A X₁₇ X₁₈ K P T L D X₁₉ E L X₂₀ K T X₂₁ x₂₂ X₂₃ X₂₄ X₂₅ A X₂₆ L R X₂₇ X₂₈ C I E A X₂₉ X₃₀ X₃₁ N X₃₂ T T X₃₃ X₃₄ R C P T Q C E X₃₅ X₃₆ X₃₇ X₃₈ E E Q D X₃₉ X₄₀ X₄₁ X₄₂ C X₄₃ X₄₄ X₄₅ X₄₆ V D R G W G N G C G L F G K G X₄₇ X₄₈ X₄₉ T C A X₅₀ F X₅₁ C X₅₂ X₅₃ X₅₄ X₅₅ E G X₅₆ X₅₇, V Q X₅₈ E N L X₅₉ Y X₆₀ X₆₁ X₆₂ X₆₃ T X₆₄ H X₆₅ G X₆₆ X₆₇ H X₆₈ V G N X₆₉ T X₇₀ X₇₁ X₇₂ G X₇₃ X₇₄ X₇₅ X₇₆ I T P Q X₇₇ X₇₈ X₇₉ X₈₀ E X₈₁ X₈₂ L X₈₃ X₈₄ Y G X₈₅ X₈₆ X₈₇ X₈₈ X₈₉ C S P R T G L D F N X₉₀ X₉₁ X₉₂ L X₉₃ X₉₄ M K X₉₅ K X₉₆ W X₉₇ V H X₉₈ Q W X₉₉ X₁₀₀ D L P L P W T X₁₀₁ G A X_(102 T X) ₁₀₃ X₁₀₄ X₁₀₅ X₁₀₆ W N X₁₀₇ K E X₁₀₈ X₁₀₉ V T F K X₁₁₀ X₁₁₁ H A K X₁₁₂ Q X₁₁₃ V V₁₁₄ V L G S Q E G X₁₁₅ M H X₁₁₆ A L X₁₁₇ C X₁₁₈ T E X₁₁₉ X₁₂₀ X₁₂₁ X₁₂₂ X₁₂₃ C X₁₂₄ T X₁₂₅ X₁₂₆ F X₁₂₇ G X₁₂₈ L K C X₁₂₉ X₁₃₀ X₁₃₁ M X₁₃₂ K L X₁₃₃ X₃₄ K G X₁₃₅ S Y X₁₃₆ M C T G X₁₃₇ F X₁₃₈ X₁₃₉ X₁₄₀ K E X₁₄₁ AN E T Q H C T X₁₄₂ X₁₄₃ X₁₄₄ X₁₄₅ V X₁₄₆ Y X₁₄₇ C X₁₄₈ X₁₄₉ X₁₅₀ P C K I P X₁₅₁ X₁₅₂ X₁₅₃ X₁₅₄ D X₁₅₅ X₁₅₆ X₁₅₇ X₁₅₈ X₁₅₉ X₁₆₀ X₁₆₁ G R L I T X₁₆₂ N P X₁₆₃ V X₁₆₄ X₁₆₅ K X₁₆₆ X₁₆₇ P V N I E X₁₆₈ E P P F G X₁₆₉ S X₁₇₀ I X₁₇₁ X₁₇₂ G X₁₇₃ X₁₇₄ X₁₇₅ X₁₇₆ X₁₇₇ L X₁₇₈ X₁₇₉ X₁₈₀ W X₁₈₁ X₁₈₂ K G S S I G X₁₈₃ M F E X₁₈₄ T X₁₈₅ R C A X₁₈₆ R M AN I L G X₁₈₇ T A W D X₁₈₈ G S X₁₈₉ X₁₉₀ X₁₉₁ X₁₉₂ X₁₉₃ X₁₉₄ X₁₉₅ X₁₉₆ X₁₉₇ X₁₉₈ X₁₉₉ X₁₂₀ X₂₀₁ X₂₀₂ X₂₀₃ X₂₀₄ X₂₀₅ X₂₀₆ X₂₀₇ X₂₀₈ X₂₀₉ X₂₁₀ X₂₁₁ X₂₁₂ X₂₁₃ wherein X₁ is an A, V, T, or G residue, X₂ is a V or I residue, X₃ is an I or F residue, X₄ is an L or M residue, X₅ is an A or T residue, X₆ is a Y or M residue, X₇ is an A, T, or G residue, X₈ is a V or I residue, X₉ is a V, I, or T residue, X₁₀ is a S or C residue, X₁₁ is a R or G residue, X₁₂ is a L or V residue, X₁₃ is an A or C residue, X₁₄ is an A, T, or S residue, X₁₅ is a L or V residue, X₁₆ is a S or C residue, X₁₇ is a K, R, or Q residue, X₁₈ is a G or N residue, X₁₉ is an I or F residue, X₂₀ is a L, I, or Q residue X₂₁ is a T, I, or E residue, X₂₂ is an A or V residue, X₂₃ is a K or T residue, X₂₄ is a Q, E, or N residue, X₂₅ is an L, V, or P residue, X₂₆ is a L, V, or T residue, X₂₇ is a K or T residue, X₂₈ is a L or Y residue, X₂₉ is a K or S residue, X₃₀ is a L or I residue, X₃₁ is a T or S residue, X₃₂ is an I or T residue, X₃₃ is an A, E residue, or D residue, X₃₄ is a Tor S residue, X₃₅ is an A or P residue, X₃₆ is an I, T, Y, or N residue, X₃₇ is a L or M residue, X₃₈ is a K, V, or P residue, X₃₉ is a T or Q residue, X₄₀ is a Q or N residue, X₄₁ is a F or Y residue, X₄₂ is a V or I residue, X₄₃ is a K or R residue, X₄₄ is a H or R residue, X₄₅ is a T, S, or D residue, X₄₆ is a V, F, Y, or M residue, X₄₇ is a S or G residue, X₄₈ is a L, V, or I residue, X₄₉ is a V or I residue, X₅₀ is a K or M residue, X₅₁ is a K, T, or Q residue, X₅₂ is a L, V, or K residue, X₅₃ is a K, T, or E residue, X₅₄ is a K, F, or N residue, X₅₅ is a L I N residue, X₅₆ is a K or N residue, X₅₇ is a V I residue, X₅₈ is a H, P, or Y residue, X₅₉ is a K or E residue, X₁₆₀ is a T S residue, X₁₆₁ is a V I residue, X₁₆₂ is a V or I residue, X₁₆₃ is a V or I residue, X₁₆₄ is a V or P residue, X₁₆₅ is a T or S residue, X₁₆₆ is an E or D residue, X₁₆₇ is a Q or E residue, X₁₆₈ is an A or Q residue, X₁₆₉ is an E or D residue, X₁₇₀ is a T, S, or G residue or represents a single residue deletion, X₇₁ is a K E N or represents a single residue deletion, X₇₂ is a H or Q residue, X₇₃ is a V, K, or T residue, X₇₄ is a T, I, or E residue, X₇₅ is an A, V, or I residue, X₇₆ is a K, T, or E residue, X₇₇ is an A or S residue, X₇₈ is a S or P residue, X₇₉ is a T or I residue, X₈₀ is a V, T, or S residue, X₈₁ is an A or I residue, X₈₂ is an I, Q, or E residue, X₈₃ is a T or P residue, X₈₄ is a G, E, or D residue, X₈₅ is an A or T residue, X₈₆ is a L or V residue, X₈₇ is a T or G residue, X₈₈ is a L or M residue, X₈₉ is an E or D residue, X₉₀ is a R or E residue, X₉₁ is a V or M residue, X₉₂ is a V or I residue, X₉₃ is a L or N residue, X₉₄ is a K or T residue, X₉₅ is a K, S, or N residue, X₉₆ is an A, T, or S residue, X₉₇ is a L or N residue, X₉₈ is a K, R, or G residue, X₉₉ is a L or F residue, X₁₀₀ is a L or F residue, X₁₀₁ is an A or S residue, X₁₀₂ is a T, S, or D residue, X₁₀₃ is an S or E residue, X₁₀₄ is a V, T, Q, or E residue, X₁₀₅ is a V, H, E, or P residue, X₁₀₆ is a T or H residue or represents a single residue deletion, X₁₀₇ is a H or R residue, X₁₀₈ is a L or R residue, X₁₀₉ is a L or M residue, X₁₀₀ is a V, T, or N residue, X₁₁₁ is an A or P residue, X₁₁₂ is a K or R residue, X₁₁₃ is an E or D residue, X₁₁₄ is a V or T residue, X₁₁₅ is an A T residue, X₁₁₆ is a T S residue, X₁₁₇ is an A or T residue, X₁₁₈ is an A or T residue, X₁₁₉ is a V or I residue, X₁₂₀ is a Q or D residue, X₁₂₁ is a T, S, N, or M residue, X₁₂₂ is a S or G residue, X₁₂₃ is a S or D residue, X₁₂₄ is a T or N residue, X₁₂₅ is a L, T, or H residue, X₂₆ is a L, I, or M residue, X₁₂₇ is an A or T residue, X₁₂₈ is a H or R residue, X₁₂₉ is a K or R residue, X₁₃₀ is a L or V residue, X₁₃₁ is a K or R residue, X₁₃₂ is an E or D residue, X₁₃₃ is a T, R, or Q residue, X₁₃₄ is a L or I residue, X₁₃₅ is a V or M residue, X₁₃₆ is a V, T, or S residue, X₁₃₇ is a K or S residue, X₁₃₈ is a K or Q residue, X₁₃₉ is a L or I residue, X₁₄₀ is a V or E residue, X₁₄₁ is a V or I residue, X₁₄₂ is a V or I residue, X₄ is a L or V residue, X₁₄₄ is a V or I residue, X₁₄₅ is a K, R, or Q residue, X₁₄₆ is a K, Q, or E residue, X₁₄₇ is a K or E residue, X₁₄₈ is a T, E, or D residue, X₁₄₉ is a C or D residue, X₁₅₀ is an A or S residue, X₁₅₁ is a L, V, or F residue, X₁₅₂ is a S or E residue, X₁₅₃ is an I, T, or S residue, X₁₅₄ is a Q, E, or M residue, X₁₅₅ is a L, C, or E residue, X₁₅₆ is a K, Q, or E residue, X₁₅₇ is a K or C residue, X₁₅₈ is a K, V, or R residue, X₁₅₉ is an A, T, or H residue, X₁₆₀ is a V, H, or Q residue, X₁₆₁ is a L or N residue, X₁₆₂ is an A or V residue, X₁₆₃ is an A, V, or I residue, X₁₆₄ is an I or T residue, X₁₆₅ is a K, E, or D residue, X₁₆₆ is an E or D residue, X₁₆₇ -, is a K, S, or E residue, X₁₈₆ is a L or A residue, X₁₆₉ is an E or D residue, X₁₇₀ is a Y or N residue, X₁₇₁ is a V or I residue, X₁₇₂ is a V or I residue, X₁₇₃ is an A, V, or I residue, X₁₇₄ is a C or E residue, X₁₇₅ is an E, P, or D residue, X₁₇₆ is a K, S, or G residue, X₁₇₇ is an A or Q residue, X₁₇₈ is a K or T residue, X₁₇₉ is a L or I residue, X₁₈₀ is a S, H, or N residue, X₁₈₁ is a F or Y residue, X₁₈₂ is a K or R residue, X₁₈₃ is a K or Q residue, X₁₈₄ is an A, T, or S residue, X₁₈₅ is an A, Y, or M residue, X₁₈₆ is a K or R residue, X₁₈₇ is an E or D residue, X₁₈₈ is a L or F residue, X₁₈₉ is an A, L, V, or I residue, X₁₉₀ is a G or Y residue, X₁₉₁ is a T or C residue, X₁₉₂ is a L or V residue or represents a single residue deletion, X₁₉₃ is a L or F residue or represents a single residue deletion, X₁₉₄ is a T or N residue or represents a single residue deletion, X₁₉₅ is a S residue or represents a single residue deletion, X₁₉₆ is a L, V, or I residue or represents a single residue deletion, X₁₉₇ is a C residue or represents a single residue deletion, X₁₉₈ is a K residue or represents a single residue deletion, X₁₉₉ is an A or M residue or represents a single residue deletion, X₂₀₀ is a L or V residue or represents a single residue deletion, X₂₀₁ is a H residue or represents a single residue deletion, X₂₀₂ is a Q residue or represents a single residue deletion, X₂₀₃ is a V or I residue or represents a single residue deletion, X₂₀₄ is a F residue or represents a single residue deletion, X₂₀₅ is a G residue or represents a single residue deletion, X₂₀₆ is an A or S residue or represents a single residue deletion, X₂₀₇ is a V or I residue or represents a single residue deletion, X₂₀₈ is a F or Y residue or represents a single residue deletion, X₂₀₉ is a T or C residue or represents a single residue deletion, X₂₁₀ is an A, T, or S residue or represents a single residue deletion X₂₁₁ is a V or M residue or represents a single residue deletion, X₂₁₂ is a G or F residue or represents a single residue deletion, and X₂₁₃ is a K or G residue or represents a single residue deletion, 52. DEN-1 MGIIFILLMLVTPSMA PRM15 53. DEN-2 MGLILILQTAVAPSMT PRM15 54. DEN-3 MVVIFILLMLVTPSMT PRM15 55. DEN-4 MTVFFILMMLVAPSYG PRM15 56. 6C6 PRM15 MTVFFTLNMLVTPSMA 57. PRM15 6F4 MAVFFILLMLVTPSMT 58. PRM15 7A9 MTVFFTLMMLVAPSYA 59. PRM15 MTVFFILLMLVAPSYG 11E2 60. PRM15 MAVFFTLMMLVAPSYG 61. 12E3 PRM15 MGIIFILLMLVTPSYG 62. 15C2 PRM15 MTVFFILMMLVTPSMT 63. 17A12 PRM15 MTVFFILLMLVTPSMT 64. 17D5 PRM15 MTVFFILMMLVAPSMA 65. 17G5 2/7 MVVIFTLLMLVTPSMTMRCIGISNRDFVEGVSGATWVDVVLEHGSCVTTMAKNKPTLD IELIKTEATQPATLRKYCIEAKLTNTTTESRCPTQGEPYLKEEQDQNYVCKHTYVDRG WGNGCGLFGKGSLITCAKFKCVTKLEGKIVQYENLKYSVIVTVRTGDQHQVGNETTEH GTIATITPQAPTSETQLTDYGALTLDCSPRTGLDFNRVVLLTMKKKTWLVHKQWFLDL PLPWTAGASTSQETWNRKELLVTFKNAHAKKQEVVVLGSQEGAMHTALTGATEVDSGD GNLLFTGHLKCRLKMDKLQLKGMSYSMCTGKFQIVKEIAETQHGTIVIRVQYEGDGAP CKIPFSTEDGQGKAHNGRLITANPIVIDKEKPVNTELEPPFGDSYIVVGAGDKALKTN WYKKGSSIGKMFESTYRGAKRMAILGETAWDFGSVGGVFTSIGKALHQVFGAIYGA 66. 2G11 MTVFFILLMLVTPSMAMRCVGIGNRDFVEGLSGATWVDVVLEHGGCVTTMAKNKPTLD FELIKTTAKEVALLRTYCIEASISNITTATRCPTQGEPYLKEEQDQQYICRRDVVDRG WGNGCGLFCKGSLITCAKFKCVTKLEGNIVQPENLEYTIVITPHTGDQHQVGNDTQGV TVETTPQASTVEAILPEYGTLGLECSPRTCLDFNEMILLTMKNKAWMVHRQWFFDLPL PWTSGATTETPTWNRKELLVTFKNAHAKKQEVVVLGSQEGAMHSALAGATEIQMSSGN LLFTGHLKCRLKMDKLQLKGMSYTMCTGKFQIVKETAETQHGTTLIKVEYKCEDAPCK IPFSTEDEKGVTQNGRLITANPTVTEKDSPVNIEAEPPFGESYTVVGACEKALKLSWF KKGSSIGQMFESTYRGAKRMAILGETAWDFCSVGGLLTSLGKAVHQVFCSVYTTMFG 67. 5/21 MTVFFILMMLVAPSYGMRCVGVGNRDFVEGLSGATWVDVVLEHGSCVTTMAKNKPTLD IELLKTEVTNPATLRKYCIEAKITNITTDSRCPTQGEAILPEEQDQNYVCKHTYVDRC WGNGCGLFGKCSLVTCAKFTCKKNMEGNIVQPENLEYTIVITPHSGEEHAVCNDTGKH GKETKTTPQSSITEAELTGYGTVTMECSPRTGLDFNRVVLLTMKKKSWLVHKQWFLDL PLPWTAGADTSEVHWNHKERMVTFKVPHAKRQDVTVLGSQEGAMHSALAGATEIQMSS GNLLFTGHLKCRLRMDKLQLKGMSYVMCTGKFQIVKEIAETQHGTTVIRVQYEGDGSP CKIPLEIMDLEKRHVLGRLITVNPIVIDKEKPVNIEAEPPFGESYIVVGVGDSALTLH WFRKGSSIGQMFETTNRGAKRMAILGDTAWDFGSLGGVPTSIGKALHQVFGATYGA 68. 6C6 MTVFFTLMMLVTPSMAMRCVGIGNRDFVEGVSGGAWVDLVLEHGGCVTTMAKNKPTLD FELIKTTAKEVALLRTYCIEASISNITTATRCPTQGEPYLKEEQDQQYICRRDVVDRC WGNGCGLFGKGGTVTCAMFTCKKN4EGNIVQPENLEYTIVITPHSGEEHAVGNDTGKH GKEIKITPQSSTTEAELTGYGTVTMECSPRTGLDFNEMILMKMKNKAWMVIRQWFLDL PLPWTAGADTETPTWNRKELLVTFKNAHAKKQEVVVLGSQEGAMHSALAGATEVDSGD GNHMFAGHLKCKVRMDKLQLKGMSYSMCTGKFQIVKEIAETQHGTIVIRVQYEGDGSP CKTPLEIMDLEKRHVLGRLTTVNPIVTEKDSPVNIEAEPPFGDSYIITGVEPGQLKLH WFKKGSSIGKMFEATARGARPMAILGDTAWDFGSAYT 69. 6E12 MTVFFILMMLVAPSYGMRCVGVGNRDFVEGLSGATWVDVVLEHGSCVTTMAKNKPTLD IELLKTEVTNPAVLRKLCIEASISNITTATRCPTQGEATLPEEQDQNYVCKHTYVDRG WGNGCGLFGKGSVVTCAKFQCLEPIEGKVVQHENLKYTVIITVJTGDQHQVGNDTQGV TVEITPQASTVEAILPEYGTLGLECSPRTGLDFNEMILLTMKNKAWMVHRQWFFDLPL PWTAGADTSEVHWNHKERMVTFKTAHAKKQDVVVLGSQEGAMHTALTGATEIQMSSGT TTIFAGHLKCKVRMDKLTLKGVSYVMCTGSFKLEKEVAETQHGTVLVQVKYEGTDAPC KIPFSSQDEKGVTQNGRLITANPIVIDKEKPVNIEAEPPFGDSYIIIGVEPGQLKLHW FKKGSSIGQMFETTMRGAKRMAILGDTAWDFGSVGGLLTSLGKMVHQTPGSVYTTMFG 70. 6F4 MAVFFILLMLVTPSMTMRCVGIGNRDFVEGVSGGAWVDLVLEHGGCVTTMAKIKPTLD FELTKTEATQPATLRKYCIEAKLTNTTTESRCPTQGEATLPEEQDQQYICRRDVVDRG WGNGCGLFGKGSLITCAKFQCLEPIEGKVVQHENLKYTVIITVITGDQHQVGNDTQGV TVETTPQASTVEAILPEYGTLGLECSPRTGLDFNEMILLTMKKKSWLVHKQWFLDLPL PWTAGADTSEVHWNHKERMVTFKVPHAKRQDVTVLGSQEGAMHSALAGATEVDSGDGN HMFAGHLKCKVRMEKLRIKGMSYSMCTGKFQIVKEIAETQHGTIVIRVQYEGDGSPCK IPLEIMDLEKRHVLGRLITVNPIVTEKDSPVNIEAEPPFGDSYIIIGVEPGQLKLHWF KKGSSIGQMFEATARGARPMAILGDTAWDFGSVGGVFTSTGKALHQVFGAIYGA 71. 7A9 MTVPFILMMLVAPSYAMRCVGIGNRDFVEGVSGATWVDVVLEHGGCVTTMAKNKPTLD IELQKTEATQLATLRKLCIEASISIITTATRCPTQGEATLVEEQDTNPVCRRTFTDRG WGNGCGLFGKGSLVTCAKFQCLEPIEGKVVQHENLKYTIVITPHSGEEHAVGNDTGKH GKEIKITPQSSTTEAELTGYGTVTMECSPRTGLDFNEMILLTMKSKTWLVHKQWFLDL PLPWTAGADTSEVHWNHKERMVTFKNAHAKKQEVVVLCSQEGANITALTGTTEIQNSG NLLFTCHLKCRLRIMDKLQLKGMSYSMCTGKFQTVKEIAETQHGTIVMVQYEGDGSPC KIPLEIMDLEKRHVLGRLITVNPIVTEKDSPVNIEAEPPFGDSYIIIGVEPGQLKLHW FRKGSSIGKMFESTYRGAKRMAILGETAWDFGSVGGLLTSLGKAVHQVFGSVYTTMFG 72. 11B1 MVVIFILLMLVTPSMTMRCIGISNRDFVEGVSGATWVDVVLEHGSCVTTMAKNKPTLD IELIKTEATQPATLRKYCIEAKLTNTTTATRCPTQGEAILPEEQDQNYVCKHTYVDRG WGNGCGLFGKGSVVTCAKFQCLEPIEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGV TVEITPQASTVEAILPEYGTLGLECSPRTGLDFNEMTLLTIKNKAWMVHRQWFFDLPL PWTSGATTETPTWNHKERIVTFKVPHAKRQDVTVLGSQEGAMHTALTGATEVDSGDGN HMFAGHLKCRLKMDKLQLKGMSYSMCTGKFQIVKEIAETQHGTIVIRVQYEGDGSPCK IPLEIMDLEKRHVLGRLITANPIVTEKDSPVNIEAEPPFGDSYIIIGVEPGQLKLHWF RKGSSIGQMIFESTYRGAKRMAILGETAWDFGSVGGLLTSLGKAVHQVFGAIYGA 73. 11B8 MTVFFILLMLVAPSYGMRCVGVGNRDFVEGLSGATWVDVVLEHGSCVTTMAKNKPTLD IELLKTEVTNPATLRKYCIEAKLTNTTTESRCPTQCEPYLKEEQDQNYVCKHTYVDRG WGNGCCLFGKGSVVTCAKFQCLEPIEGKVVQHENLKYTVIITVHTGDQHQVGNDTQCV TVEITPQASTVEAILPEYGTLGLECSPRTGLDFNRVVLLTMKKKTWLTHKQWFLDLPL PWTAGADTETPTWNRKELLVTFKNAHAKKQEVVVLGSQEGAMHSALAGATEIQMSSGN HMFAGHLKCKVRIMEKLQLKGMSYVMCTGKFQIVKEIAETQHGTIVMVQYEGDGAPCK IPFSTEDGQGKAHNCRLITANPIVIDKEKPVNIELEPPFGDSYIVVGAGDKALKINWY KKGSSIGQMFEATARGARRMAILGDTAWDFGSVGGLLTSLGKMVHQIFGSVYTTMFG 74. 11C4 MTVFFILMMLVAPSYGMRCVGIGNRDFVEGLSGATWVDVVLEHGSCVTTMAKNKPTLD IELLKTEVTNPATLRTYCTEASISNITTESRCPTQGEPYLKEEQDQNYVCKHTYVDRG WGNGCGLFGKGSVVTCAKFQCLEPIEGKWQHENLKYTTVITPHSGEEHAVGNDTGKIT GKETKITPQSSITEAELTGYGTVTMECSPRTGLDFNEMILLTMKSKTWLVHKQWFLDL PLPWTAGADTSQETWNRKELLVTFKNAHAKKQEVVVLGSQEGAMHTALTGATEVDSGD GNLLFTGHLKCRLKMDKLQLKGMSYSMCTGKFQIVKEIAETQHGTIVIRVQYEGDGSP CKIPLETMDLEKRIVLGRLTTVNPIVTEKDSPVNTEAEPPFGDSYIIIGVGDKALKLH WPKKGSSIGKMFEATARGAKPMAILGETAWDFGSAYT 75. 11C11 MTVFFILMMLVAPSYGMRCVGVGNRDFVEGLSGATWVDVVLEHGSCVTTMAKNKPTLD IELLKTEVTNPAVLRKLCIEASISNTTTESRCPTQGEPYLKEEQDQNYVCKHTYVDRG WGNGCGLFGKGSLITCAKFKCVTKLEGKIVQYENLKYSVIVTVHTGDQHQVGNETQGV TVETTPQASTVEATLPEYGTLGLECSPRTGLDFNEMILLTMKKKTWLVHKQWFLDLPL PWTAGADTSQETWNRKELLVTFKNAHAKKQEVVVLGSQEGAMHTALTGATEVDSGDGN LLFTGHLKCRLKMDKLQLKGMSYSMCTGKFQIVKETAETQHGTIVIRVQYECEDAPCK IPFSTEDGQGKAHNGRLITANPIVTEKDSPVNTEAEPPFGDSYIVVGAGEKALKLIWF KKGSSIGQMFEATARGARPMAILGDTAWDFGSVGGVFTSICKALHQVFGATYGA 76. 11E2 MTVFFILLMLVAPSYGMRCVGVCNRDFVEGLSGATWVDVVLEHGSCVTTMAKNKPTLD IELLKTEVTNPAVLRKLCIEASISNITTATRCPTQGEAILPEEQDQNYVCKHTYVDRG WGNGCGLFGKCSLVTCAKFKCVTKLEGKTVQYENLKYSVIVTVHTGDQHQVGNETTEH GTIATTTPQAPTSEIQLTDYGALTLDCSPRTGLDFNRVVLLTMKKKTWLVHKQWFLDL PLPWTAGASTSQETWNRKELLVTFKNAHAKRQDVTVLGSQEGAMHSALTGATEIQMSS GNLLFAGHLKCRLKMDKLQLKGMSYSMCTCKFQIVKETAETQHGTIVTRVQYEGDGAP CKIPLEIMDLEKRHVLGRLITVNPIVTEKDSPVNIEAEPPFGESYIVVGVGDSALTLH WFRKGSSICQMFEATAPGAKRMAILGDTAWDPGSVGGVFTSIGKALHQVFGAIYCA 77. 12E3 MAVFFILMMLVAPSYGMRCVGVGNRDFVEGLSGATWVDVVLEHGSCVTTMAKNKPTLD IELIKTTAKEVALLRTYCIEASISNTTTATRCPTQGEAILPEEQDQNYVCKHTYVDRG WGNGCGLFGKGSLVTCAKFTCKKNMEGNIVQPENLEYTIVITPHSGEEHAVGNDTGKH GKEIKITPQSSITEAELTGYGTVTMECSPRTGLDFNRVVLLTMKKKSWLVHKQWFLDL PLPWTAGASTSQETWNRKELLVTFKNAHAKKQEVVVLGSQEGAIHTALTGATETQMSS GNLLFTGHLKCRLRMDKLQLKGMSYSMCTGKFQIVKEIAETQHGTIVIRVQYEGDGAP CKIPFSTEDGQGKAHNGRLTTANPIVIDKEKPVNIELEPPFGESYIVVGAGDKALKIN WYKKGSSIGKMFESTYRGAKRMAILGETAWDFGSVGGLLTSLGKAVHQVFGSVYTTMF G 78. 12H4 MTTFFILMMLVAPSYAMRCIGISNRDFVEGVSGATWVDVTLEHGSCVTTMAKNKPTLD IELTKTEATQPATLRKYCIEASISNITTDSRCPTQGEPYLKEEQDQQYICRRDVVDRG WCNGCGLFGKGSLITCAKFKCVTKLEGKIVQYENLKYSVIVTVHTGDQHQVGNDTQGV TVEITPQASTVEATLPEYGTLGLECSPRTGLDFNEMILLTMKKKSWLVHKQWFLDLPL PLPWTAGASTSQETWNRKELLVTFKNAHAKKQEVVVLGSQEGAMHTALTGATEIQMSS LLFTGHLKCRLKMDKLQLKGMSYSMCTGKFQIVKEIAETQHGTIVIRVQYEGDGAPCK IPFSTEDEKGVTQNGRLITANPIVTEKDSPVNIEAEPPFGDSYIIIGVEPGQLKLIWF KKGSSIGQMFEATARGARRMAILGDTAWDFGSAYT 79. 13E2 MTVFFILMMLVTPSMAMRCVGIGNRDFVEGVSGATWVDVVLEHGGCVTTMAKNKPTLD FELTKTTAKEVALLRTYCIEASISNITTATRCPTQGEPYLKEEQDQQYICRRDVVDRG WGNGCGLFCKCSLITCAKFKCVTKLECNIVQPENLEYTIVITPHSGEEHAVGNDTGKH GKEIKITPQSSITEAELTGYGTVTMECSPRTGLDFNEMILLTMKKKSWLVHKQWFLDL PLPWTAGASTSQETWNRKELLVTFKNAHAKKQEVVVLGSQEGANITALTGATEIQMSS GNLLFTGHLKCRLRMDKLQLKGMSYTMCTGKFQIVKEIAETQHGTIVIRVQYEGTDAP CKIPFSTEDGQGKAHNGRLITANPIVIDKEKPVNIELEPPFGDSYTIIGVEPGQLKLH WFKKGSSIGKMFEATARGAKRMAILGETAWDFGSVGGLLTSLGKMVHQIFGSVYTTMF G 80. 13E11 MTVFFILMMLVTPSMAMRCVGVGNRDFVEGVSGGAWVDVVLEHGGCVTTMAKNKPTLD IELIKTEATQPATLRKYCIEASISNITTDTRCPTQGEPYLKEEQDQNYVCKITYVDRG WGNGCGLFGKGSLITCAKFKCVTKLEGKIVQYENLKYSVIVTVHTGDQHQVGNETTEH GTIATITPQAPTSEIQLTDYGTVTMECSPRTGLDFNEMILLTMKSKTWLVHKQWFLDL PLPWTAGADTETPTWNRKELLVTFKNAHAKKQEVVVLGSQEGAMHTALAGATEIQMSS GNLLFTGHLKCRLKMDKLQLKGMSYTMCTGKFQIVKEIAETQHGTIVIRVQYEGTDAP CKIPPSTEDGQGKAHNGRLITANPIVIDKEKPVNTELEPPFGESYIVVGAGEKALTLH WFRKGSSIGQMFETTMRGAKRMAILGETAWDFGSVGGVFTSIGKALHQVFGAIYGA 81. 13F11 MTVFFILMMLVTPSMAMRCVGTGNRDFVEGVSGGAWVDLVLEHGGCVTTMAKNKPTLD IELIKTEATQPATLRKYCIEAKITNITTDSRCPTQGEPYLKEEQDQQYTCRRDVVDRG WGNGCGLFGKGSLITCAKFKCVTKLEGKIVQYENLKYSVTVTVHTGDQHQVGNETTEH GTIATITPQAPTSEIQLTDYGALTLDCSPRTGLDFNRVVLLTMKKKTWLVHKQWFLDL PLPWTAGADTSEVHWNHKERMVTFKVPHAKRQDVTVLGSQEGAIHSALAGATEVDSGD GNHMFAGHLKCRLKMDKLQLKGMSYSMCTGKFQTVKEIAETQHGTIVIRVQYEGDGAP CKIPFSTEDEKGKAHNGRLITANPIVIDKEKPVNIELEPPFGESYIVVGAGEKALKLS 82. 14B1 WFKKGSSIGKMFESTYRGAKRMAILGDTAWDFGSVGGVFTSIGKALHQVFGAIYGA MTVFFTLMMLVAPSYGMRCVGISNRDFVEGVSGGAWVDLVLEHGGCVTTMAKNKPTLD IELLKTEVTNPATLRKYCIEAKITNITTDSRCPTQGEPYLKEEQDQQYICRRDVVDRG WGNGCGLFGKGSLITCAKFKCVTKLEGNIVQPENLKYTIVITPHTGDQHQVGNDTQGV TVEITPQASTVEAILPEYGTLGLECSPRTGLDFNEMILLTNKKKSWLVHKQWFLDLPL PWTAGADTETPTWNRKELLVTFKNAHAKRQDVTVLGSQEGAMHSALTGATEVDSGDGN LLFTGHLKCRLKMDKLQLKGMSYTMCTGKFQIVKEIAETQHGTTVTRVQYEGDGAPCK IPFSTEDEKGVTQNGRLITANPIVTEKDSPVNIEAEPPFGESYIVVGAGEKALKLSWF KKGSSIGQMFESTARGARRMAILGDTAWDFGSAYT 83. 14E9 MTVFFILLMLVTPSMAMRCVGIGNRDFVEGVSGGAWVDLVLEHGGCVTTMAKNKPTLD IELIKTEATQPATLRKYCIEAKITNITTDSRCPTQGEAILPEEQDQQYICRRDVVDRG WGNGCGLFGKGSVVTCAKFQCVTKLEGNIVQPENLEYTIVITPHTGDQHQVGNDTQGV TVEITPQASTVEAILPEYGTLGLECSPRTGLDFNEMILMKMKNKAWMVHKQWFLDLPL PWTSGASTSQETWNRKELLVTFKNAHAKKQEVVVLGSQEGAMHSALAGATEIQMSSGN LLFTGHLKCRLKMDKLQLKGMSYTMCTGKFQIVKEIAETQHGTIVIRVQYEGDGSPCK IPLEIMDLEKRHVLGRLITANPIVIDKEKPVNIELEPPFGDSYIVVGAGDKALKINWY KKGSSIGKMFESTYRGAKRMAILGETAWDFGSAYT 84. 14G10 MAVFFILLMLVTPSMAMRCIGISNRDFVEGVSGATWVDVVLEHGGCVTTMAKNKPTLD IELLKTEVTNPATLRKYCTEAKTTNITTDTRCPTQGEAILPEEQDQQYICRRDVTDRG WGNGCGLFGKGSLVTCAKFTCKKNNEGNIVQPENLEYTIVITPHSGEEHAVGNETTEH GTIATITPQAPTSEIQLTDYGALTLDCSPRTGLDFNRVVLLTMKKKTWLVHKQWFLDL PLPWTAGADTSEVHWNHKERMVTFKVPHAKRQDVTVLGSQEGAMHSALAGATEIQMSS GNLLFTGHLKCRLKMDKLQLKGMSYSMCTGKFQIVKEIAETQHGTIVIRVQYEGDGAP CKIPFSTEDGQGKAHNGRLITANPIVIDKEKPVNIELEPPFGDSYIVVGAGDKALKIN WYKKGSSIGKMFESTYRGAKRMAILGETAWDFGSVGGVFTSIGKALHQVFGAIYGA 85. 14H2 MTVFFILMMLVAPSYGMRCVGVGNRDFVEGVSGGAWVDLVLEHGGCVTTMAKNKPTLD FELIKTTAKEVALLRTYCIEAKITNITTATRCPTQGEPYLKEEQDQNYVCKHTYVDRG WGNGCGLFGKGSLITCAKFQCVTKLEGNIVQPENLEYTIVITPHSGEEHAVGNDTQGV TVEITPQASTVEAILTDYGALTLDCSPRTGLDFNRVVLLTMKKKTWLVHKQWFLDLPL PWTAGADTSEVHWNHKERMVTFKVPHAKKQEVVVLGSQEGAMHTALTGATEVDSGDGN LLFTGHLKCRLKMDKLQLKGMSYSMCTGKFQIVKEIAETQHGTIVIRVQYKGEDAPCK IPFSSQDGQGKAHNGRLITANPTVTEKDSPVNIEAEPPFGESYIVVGAGEKALKLSWF KKGSSIGQMFETTMRGAKRMAILGETAWDFGSVGGVFTSIGKALHQVFGAIYGA 86. 15C2 MVVIFILLMLVTPSMAMRCVGISNRDFVEGLSGATWVDVVLEHGSCVTTMAKNKPTLD IELLKTEVTNPATLRKYCIEAKITNITTDSRCPTQGEPYLKEEQDQQYICRRDVVDRG WGNGCGLFGKGSVVTCAKFQCLEPIEGKIVQYENLKYSVIVTVHTGDQHQVGNDTQGV TVEITPQASTVEAILPEYGTLGLECSPRTGLDFNEMTLLTMKNKAWMVHRQWFFDLPL PWTAGADTSEVHWNHKERMVTFKVPHAKRQDVTVLGSQEGAMHSALAGATEVDSGDGN HMFAGHLKCRLKMDKLQLKGMSYVMCTGKFQIVKEIAETQHGTIVIRVQYEGDGAPCK IPFSTEDEKGVTQNGRLITANPIVIDKEKPVNIEAEPPFGESYIVIGVEPGQLKLHWF KKGSSIGQMFEATARGARRMAILGETAWDFGSVGGLLTSLGKAVHQVFGSVYTTMFT 87. 15D4 MTVFFILLMLVTPSMAIRCVGIGNRDFVEGLSGATWVDVVLEHGGCVTTMAKNKPTLD FELIKTTAKEVALLRTYCIEAKITNITTDSRCPTQGEAILPEEQDQNYVCKHTYVDRG WGNGCGLFGKGSLITCAKFKCLEPIEGKVVQPENLEYTIVITPHSGEEHAVGNDTGKH GKEIKITPQSSITEAELTGYGTVTMECSPRTGLDFNRVVLLTMKKKSWLVHKQWFLDL PLPWTAGADTSEVHWNHKERMVTFKTAHAKKQDVVVLGSQEGAMHSALAGATEIQMSS GNLLFTGHLKCRLRMDKLTLKGVSYVMCTGSFKLEKEVAETQHGTVLVQVKYEGTDAP CKIPFSTEDEKGVTQNGRLITANPIVTEKDSPVNIEAEPPFGESYIVVGAGDKALKIN WYKKGSSIGKMFESTYRGAKRIVIAILGDTAWDFGSLGGVFTSIGKALHQTFGAIYGA 88. 15H4 MTVFFILMMLVAPSYGMPCVGVGNRDFVEGLSGATWVDVVLEHGSCVTTNAKNKPTLD FELTKTEVTNPATLRKYCIEASTSNITTATRCPTQGEANLPEEQDQNYVCKHTYTDRG WGNGCGLFGKGSLVTCAKFQCLEPIEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGV TVETTPQASTVEAILPEYGTLGLECSPRTGLDFNRVVLLTMKKKSWLVHKQWFLDLPL PWTAGADTSEVHWNHKERIVTFKTAHAKKQEVVVLGSQEGAMHSALAGATEIQMSSGN LLFTGRLKCRLKMDKLQLKGMSYVMCTGKFQIVKEIAETQHGTIVIRVQYEGDGAPCK IPLEIMDLEKRHVLGRLITVNPTVIDKEKPVNIEAEPPFGDSYIIICVEPGQLKLHWF KKGSSIGQMFESTYRGAKRMAILGDTAWDFGSVGGVLNSLGKMVHQIFGSVYTTMFG 89. 16B4 MGIIFILLMLVTPSYGMRCVGVGNRDFVEGLSGATWVDVVLEHGSCVTTNAKNKPTLD IELLKTEVTNPAVLRKLCIEASISNITTDSRCPTQGEATLPEEQDQNYVCKHTYVDRG WGNGCGLFGKGSLITCAKFKCLEPIEGKVVQHENLKYTVIITVHTGDQHQVONDTQGV TVEITPQASTVEAILPEYGTLGLECSPRTGLDFNEMILLTMKKKTWLVHKQWFLDLPL PWTAGADTSEVHWNHKEPMVTFKTAHAKKQDVVVLGSQEGAMHTALTGATEIQMSSGT TTIFAGHLKCRLKNDKLQLKGMSYTMCTGKFQIVKEIAETQHGTILIKVEYKGEDAPC KIPFSTEDEKGVTQNGRLITANPIVTEKDSPVNIEAEPPFGESYIVVGVGDKALKINW YKKGSSIGQMFETTMRGAKRNAILGDTAWDFGSVGCLLTSLGKMVHQIFGSVYTTMFG 90. 16E8 MTVFFILMILVAPSYGMRCVGVGNRDFVEGLSGATWVDVVLEHGGCVTTMAKNKPTLD FELIKTTAKEVALLRTYCIEASTSNITTATRCPTQCEPYLKEEQDQQYICRRDVVDRG WGNGCOLFGKGSLITCAKFKCVTKLEGNIVQPENLEYTIVITPHTGDQHQVGNDTQGV TVEITPQASTVEAILPEYGTLGLECSPRTGLDFNEMILLTNKNKAWNVHKQWFLDLPL PWTSGATTETPTWNRKELLVTFKTAHAKKQDVVVLGSQEGAMHTALTGATEIQMSSON LLFSTEDGQGKAHNGRLITANPIVIDKEKPVNIEAEPPFGESNIVIGIGDKALKINWY IPFSTEDGQOKAHNGRLITANPIVTDKEKPVNIEAEPPFGESNIVIGIGDKALKINWY KKGSSIGKMFETTMRGAKRMAILGDTAWDFGSVGGVFTSIGKALIIQVFGAIYGA 91. 16E10 MTVFFILNMLVAPSYGMPCIGISNRDFVEGVSGATWVDVVLEHGSCVTTMAKNKPTLD IELLKTEVTNPATLRKYCIEAKISNTTTDSRCPTQGEATLVEEQDQNYVCKHTYVDRG WGNGCGLFGKGSLITCAKFKCVTKLEOKIVQYENLKYSVIITVHTGDQHQVGNDTQOV TVEITPQASTVEAILPEYGALGLECSPRTGLDFNEMTLLTMKNKAWMVHGQWFFDLPL PWTSGATTETPTWNRKELLVTFKNAHAKKQEVVVLGSQEGAMHSALAGATEIQMSSGN LLFTGHLKCRLRMDKLQLKGMSYSMCTGKFQIVKEIAETQHGTIVIRVQYEGDGSPCK IPLEIMDLEKRHVLGRLITVNPIVIDKEKPVNIEAEPPFGESYIVVGVODSALTLHWF RKGSSIGQMFETTMRGAKRMAILGDTAWDFGSVGOVLNSLGKMVHQIFGSVYTTMFG 92. 16F12 MTVFFTLMMLVAPSYGMRCVGVGNGDFVEGLSGATWVDVVLEHCSCVTTMACYKPTLD FELIKTIAKEVALLRTYCTEASISNITTDSRCPTQGEAILPEEQDQNYVCKHTYVDRG WGNGCGLFGKGSVVTCAKFQCLEPIEGKVVQPENLEYTIVITPHTGDQHQVGNDTQGV TVEITPQASTVEAILPEYGTLGLECSPRTGLDFNEMTLLTMKNKAWMVHRQWFFDLPL PWTSGATTEVHWNHKERMVTFKTAHAKKQDVVVLGSQEGTMHTALTGATEIQMSSGTT TIFAGHLKCKVKMDKLQLKGMSYTMCTGKFQIVKEIAETQHGTIVIRVQYEGDGAPCK IPLEIMDLEKRHVLGRLITVNPIVIDKEKPVNIEAEPPFGESYIVVGVODSALKINWY KKGSSIGKMFESTYRGAKRMAILGETAWDFGSVGGVFTSICKALHQVFGAIYGA 93. 16G11 MGIIFILLMLVTPSMAMRCVGIGNRDFVEGLSGATWVDVVLEHOGCVTTMAKNKPTLD IELLKTEVTNPAVLRKLCIEASISNTTTDSRCPTQGEATLVEEQDTNFVCRRTFVDRG WONOCGLFGKGSVVTCAKFQCLEPIEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGV TVEITPQASTVEAILPEYGTLGLECSPRTGLDFNEMILLTMKNKAWLVHKQWFLDLPL PWTAGADTSEVHWNHKERMVTFKTAHAKKQDVVVLGSQEGAMHTALTGATEIQMSSGN LLFTGHLKCRLRMDKLQLKGMSYVMCTGKFQIVKEIAETQHGTIVIRVQYEGDGAPCK IPFSTEDGQGKAHNGRLITANPIVIDKEKPVNIELEPPFGDSYIVVGAGDKALKINWY KKGSSIGKMFEATARGARRMAILGDTAWDFGSLGCLLTSLGKAVHQVFGSVYTTMFG 94. 17A12 MTVFFILMMLVTPSMTMRCVGVGNRDFVEGLSGATWVDVVLEHGSCVTTMAKNKPTLD FELIKTEATQPATLRKYCIEAKTTNTTTDSRCPTQGEAILPEEQDQNYVCKHTYVDRG WGNGCGLFGKGSLITCAKFKCLEPIEGKVVQPENLEYTTVITPHTGDQHQVGNDTQGV TVEITPQASTVEAILPEYCTLGLECSPRTGLDFNEMILLTMKKKSWLVHKQWFLDLPL PWTSGATTETPTWNRKELLVTFKNAHAKRQDVTVLGSQEGAMHSALTGATEIQTSGTT TIFAGHLKCRVRMDKLQLKGMSYVMCTGSFKLEKEVAETQHGTVLVQVKYEGTDAPCK IPVSSQDEKGVTQNGRLITANPTVTEKDSPVNIEAEPPFGDSYIIIGVEPGQLKLHWF KKGSSIGQMFETTMRGAKRMAILGDTAWDLGSVGGVFTSIGKALHQVFGAIYGA 95. 17D5 MTVFFILLMLVTPSMTMRCIGISNRDFVEGVSGATWVDVVLEHGGCVTTMARNKPTLD IELIKTEATQPATLRKYCIEASISNITTATRCPTQGEAILPEEQDQQYTCRRDVVDRG WGNGCCGLFGKGSVVTCAKFQCLEIEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGV TVEITPQAPTSEIQLTDYGALTLDCSPRTGLDFNRVVLLTMKKKTWLVHKQWFLDLPL PWTSGATTETPTWNRKELLVTFKNAHAKKQEVVVLGSQEGAMHTALTGATEVDSGDGN LLFTGHLKCRLKMDKLQLKGMSYTMCTGKFQIVKEIAETQHGTILIKVEYKGEDAPCK IPFSTEDEKGVTQNGRLITANPIVTEKDSPVNIEAEPPFGESYIVVGAGDKALKINWY KKGSSIGKMFESTYRGAKRMAILGDTAWDFGSLGGVFTSIGKALHQVFGAIYGA 96. 17D11 MTVFFILMMLVAPSYGMRCVGTGNRDFVEGLSGATWVDVVLEHGGCVTTMAQGKPTLD IELIKTEATQPATLRKYCIEAKLTNTTTESRCPTQGEPYLKEEQDQNYVCKHTYVDRG WGNGCGLFGKGSVVTCAKFQCLEPIEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGV TVEITPQASTVEAILPEYGTLGLEGSPRTGLDFNEMILLTMKKKSWLVHKQWFLDLPL PWTAGASTSQETWNRKELLVTFKNAHAKKQEVVVLGSQEGAMHTALTGATEVDSGDGN LLFTGHLKCRLKNDKLQLKGMSYTMCTGKFQIVKEIAETQHGTILIKVEYKGTDAPCK IPFSSQDEKGVTQNGRLITANPIVTEKDSPVNIEAEPPFGESYIVVGAGDKALKINWY KKGSSIGKMFESTYRGAKRMAILGETAWDFGSVGGVFTSIGKALHQVFGAIYGA 97. 17F5 MTVFFILMMLVAPSYGMRCVGVGNRDFVEGLSGATWVDVVLEHGSCVTTMAKNKPTLD IELLKTEVTNPATLRKYCIEAKITNITTDSRCPTQGEAILPEEQDQNYVCKHTYVDRG WGNGCGLFGKGSVVTCAKFQCLEPIEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGV TVEITPQASTVEAILPEYGTLGLECSPRTGLDFNRVVLLTMKKKSWLVHKQWFLDLPL PWTAGADTSEVHWNHKERMVTFKTAHAKKQDVVVLGSQEGAMHSALAGATEIQMSSGN LLFTGHLKCRLRMDKLQLKGMSYVMCTGKFQIVKEIAETQHGTIVIRVQYEGDDAPCK IPFSTEDEKGVTQNGRLITANPIVTEKDSPVNIEAEPPFGESYIVVGVGDSALTLHWF RKGSSIGQMFETTMRGAKRMAILGDTAWDFGSLGGVFTSLGKAVHQIFGSVYTTMFG 98. 17F11 MVVIFILLMLVTPSMTMRCIGISNRDFVEGVSGATWVDVVLEHGGCVTTMAKNKPTLD FELIKTTAKEVALLRTYCIEASISNITTATRCPTQGEAILPEEQDQNYVCKHTYTDRG WGNGCGLFGKGSLITCAKFQCLEPIEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGV TVEITPQASTVEAILPEYGTLGLECSRTGLDFNEMILLTMKNKAWMVIIKQWLLDLPL PWTAGADTSEVHWNKERMTVTFKVPHAKKQDVVVLGSQEGAMHTALTGATEIQMSSGN LLFTGHLKCRLRMDKLQLKGMSYSMCTGKFQIVKEIAETQHGTILIKVEYKGEDAPCK IPFSTEDEKGVTHNGRLITANPIVIDKEKPVNIELEPPFGDSYIVIGVGDSALTLHWF RKGSSIGKMFESTYRGAKRMAILGETAWDFGSVGGLLTSLGKAVHQVFGSVYTTMFG 99. 17G5 MTVFFILMMLVAPSMANRCVGIGNRDFVEGLSGATWVDVVLEHGGCVTTMAKNKPTLD FELIKTTAKEVALLRTYCIEASISNITTATRCPTQGEAILKEEQDQNYVCKHTYVDRG WGNGCGLFGKGSVVTCAKFQCLEPIEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGV TVEITPQASTVEAILPEYGTLGLECSPRTGLDFNEMILLTMKNKAWMVHRQWFFDLPL PWTAGADTSEVHWNHKERVTFKTAHAKKQEVVVLGSQEGAIIHSALAGATEVDSGDGN HMFAGHLKCKVRMDKLTLKGVSYVMCTGSFKLEKEIAETQHGTILIKVEYKGEDAPCK IPFSTEDGQGKAHNGRLITANPIVIDKEKPVNIELEPPFGDSYIVVGAGDKALKINWY KKGSSIGKMFESTYRGAKRMAILGETAWDFGSVGGVFTSIGKALHQVFGAIYGA 100. 17H3 MVVIFILLMLVTPSMTMRCIGISNRDFVEGVSGATWVDVVLEHGSCVTTMAKNKPTLD IELLKTEVTNPATLRKYCIEAKITNITTATRPTQGEAILPEEQDQNYVCKIITYVDRG WGNGCGLFGKGSLVTCAIKFQCLEPIEGKVVQHENLKYTIITVITGDQHQVGNDTQGV TVEITPQASTVEAILPEYGTLGLECSPRTGLDFNEMILLTMKKKTWMVHRQWFFDLPL PWTAGADTSEVHWNIKERMVTFKTAHAKRQDVTVLGSQEGAMHSALTGATEVDSGDGN LLFTGHLKCRLKMDKLQLKGMSYTMCTGKFQIVKEIAETQHGTILIKVEYKGEDAPCK IPFSTEDGQGKAHNGRLITANPAVTDKEKPVNIELEPPFGDSYIVVGAGDKALKLSWF KKGSSIGQMFESTYRGAKRMAILGETAWDLGSIGGVFTSVGK 101. 17H10 MVVIFILLMLVTPSMTMPCIGISNRDFVEGVSGGSWVDVVLEHGSCVTTMAKNKPTLD TELLKTEVTNPATLRKYCIEAKITNTTTDSRCPTQGEAIMPEEQDQNYVCKHTYVDRG WGNGCGLFGKGSLVTCAKFTCKKNMEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGV TVEITPQASTVEAILPEYGTLGLECSPRTGLDFNEMILLTMKNKAWMVHRQWFLDLPL PWTAGADTSEVHWNHKERMVTFKTAHAKKQDVVVLGSQEGAMHTALTGATEVDSGDGN LLFTGHLKCRLKMDKLQLKGMSYVMCTGKFQIVKEIAETQHGTIVIRVQYEGDGAPCK IPFSTEDGQGKAHNGRLITANPTVIDKEKPVNIELEPPFGDSYIVVGAGDKALKINWY KKGSSIGKNFEATARGARRMAILGDTAWDFGSVGGVLTSLGKMVHQIFGSVYTTMFG 102. 17H12 MVVTFILLMLVTPSMTMRCVGVGNRDFVEGLSGATWVDVVLEHGSCVTTNAQGKPTLD IELLKTEVTNPATLRKYCIEAKLTNTTTESRCPTQGEPYLKEEQDQQYICRRDVVDRG WGNGCGLFGKGSLITCAKFKCVTKLEGKIVQYENLKYSVTVTVITGDQHQVGNETTEH GTTATITPQAPTSEIQLTDYGALTLDCSPRTGLDFNRVVLLTMKKKSWLVIKQWFLDL PWTAGADTSEVHWNHKERMVTFKTAHAKKQDVVLGSQEGAJMIIITALTGATEVDSGD GNLLFTGHLKCRLKMDKLQLKGMSYSMCTGKFQIVKEIAETQHGTIVIKVEYKGEDAP CKIPFSTEDGQGKAHNGRLITANPIVIDKEKPVNIEAEPPFGESYIVVGAGEKALKLS WFKKGSSIGQHFETTHRGAKRMATLGDTAWDFGSVGGLLTSLGKMVHQIFGSVYTTMF G 103. 18A9 MTVFFILLMLVTPSMAIRCVGIGNRDFVEGLSGATWVDVVLEHGSCVTTMAKNKPTLD IELLKTEVTNPATLRKYCIEAKITNITTATRCPTQGEAILPEEQDQNYVCKHSMVDRG WGNGCGLFGKGSVVTCAKFQCLEPIEGKVVQHENLKYTVITTVHTGDQHQVGNDTQGV TVEITPQASTVEAILPEYGTLGLECSPRTGLDFNEMILLTMKNKAWMVHKQWFLDLPL PWTAGADTSEVHWNHKERMVTFKVPHAKRQDVTVLGSQEGAMHSALAGATEIQMSSGN LLFTGHLKCRLPMDKLTLKGVSYVMCTGKFQIVKETIAETQHGTIVMVQYEGTDAPCK IPFSSQDEKGVTQNGRLITANPIVIDKEKPVNIELEPPFGDSYIVVGAGDKALTLIWF RKGSSIGQMFETTMRGAKPMAILGDTAWDFGSVGGLLTSLGKMVHQIFGSVFTSVGK 104. 18B7 MTVFFILLMLVTPSMAMRCIGISNRDFVEGLSGATWVDVVLEHGSCVTTMAKNKPTLD FELIKTTAKEVALLRTYCIEASISIITTATRCPTQGEPYLKEEQDQQYICRRDVVDRG WGNGCGLFGKGSVVTCAKFQCLEPIEGKVVQHENLKYTVIITVITGDQHQVGNDTQGV TVEITPQASTVEAILPEYGTLGLECSPRTGLDFNEMTLLTMKNKAWMVIKQWFLDLPL PWTAGADTSEVHWNHKERMVTFKTAHAKKQDVVVLGSQEGAMHTALTGATEVDSGDGN LLFTGHLKCRLPMDKLQLKGMSYSMCTGKFQIVKEIAETQHGTILIKVEYKGEDAPCK IPFSTEDEKGVTQNGRLITANPIVIDKEKPVNIEAEPPFGDSYIVVGAGDKALKTNWY KKGSSIGKMFESTYRGAKRMAILGETAWDFGSVGGLLTSLGKAVHQVFGSVYTTMFG 105. 18D7 MVVIFILLMLVTPSMTMRCIGISNRDFVEGVSGATWVDVVLEHGSCVTTMAKNKPTLD FELLKTEVTNPAVLRKLCIEASISNITTATRCPTQGEAILPEEQDQNYVCKHTYVDRG WGNGCGLFGKGSLITCAKFKCVTKLEGKIVQYENLKYSVIVTVITGDQHQVGNDTQGV TVEITPQASTVEAILPEYGTLGLECSPRTGLDFNEMILLTMKNKAWMVIRQWFFDLPL PWTAGADTSEVHWNHKEPIVTFKTAAKKQDVVVLGSQEGAI4ITALTGATEIQMSSGT TTIFAGHLKCRLKMDKLQLKGMSYSMCTGKFQITKEIAETQHGTILIKVKYEGTDAPC KIPFSSQDEKGVTQNGRLITANPIVTEKDSPVNIEAEPPFGESYIVVGAGEKALKLSW FKKGSSIGQMFESTYRGAKRMAILGETAWDFGSVGGLLTSLGKAVHQVFGSVYTTMFG 106. 18E9 MTVFFILMMLVAPSYGMRCVGVGNRDFVEGLSGATWVDVVLEHGGCVTTMAKNKPTLD FELIKTEATQPATLRKYCIEAKLTNTTTATRCPTQGEPYLKEEQDQNYVCKITYVDRG WGNGCGLFGKGSVVTCAKFQLCEPIEGKVVQHENLKYSVIVTVHTGDQHQVGNDTQGV TVEITPQASTVEAILPEYGTLGLECSPRTGLDFNEMILLTMKKKSWLVHKQWFLDLPL PWTSGASTSQETWNRKELLVTFKNAHAKRQDVTTLGSQEGAMHTALTGATEVDSGDGN LLFTGHLKCRLKMDKLQLKGMSYTMCTGKFQIVKEIAETQHGTILIKVEYKGEDAPCK IPFSSQDEKGVTQNGRLITANPIVIDKEKPVNIELEPPFGDSYIVVGAGDKALKINWY KKGSSIGKMFESTYRGAKRMAILGDTAWDFGSVGGVLNSLGKMVHQIFGSVYTTMFG 107. 18E10 MTVFFILLMLVTPSMANRCVGVGNRDFVEGLSGATWVDVVLEHGGCVTTMAKNKPTLD IELQKTEATQLATLRKYCIEAKLTNTTTESCRPTQGEAILPEEQDQNYVCKHTYVDRG WGNGCGLFGKGSLITCAKFKCVTKLEGKIVQYENLKYSVIVTVHTGDQHQVGNETTEH GTIATITPQAPTSEIQLTDYGALTLDCSPRTGLDFNRVVLLTMKKKSWLVHKQWFLDL PLPWTAGADTSEVHWNHKERMVTFKTAHAKKQEVVVLGSQEGAMHTALTGATEVDSGD GNLLFTGHLKCRLRMDKLQLKGMSYSMCTGKFQIVKETAETQHCTIVIRVQYEGDGSP CKIPLEIMDLEKRRVLGRLITANPIVIDKEKPVNTELEPPFGDSYIVVGAGDKALKIN WYKKGSSIGQMFETTMRGAKRMAILGDTAWDFGSVGGLLTSLGKAVHQVFGSVYTTMF G 108. 18E11 MTVFFILMMLVAPSYGMRCVGVGNRDFVEGLSGATWVDVVLEHGGCVTTMAKNKPTLD FELIKTTVKEVALLRTYCIEASISNITTATRCPTQGEPYLKEEQDQQYTCRRDVVDRG WGNGCGLFGKGSLITCAKFKCVTKLEGNIVQPENLEYTTVITPHTGDQHQVGNDTQGV TVEITPQASTVEAILPEYGTLGLECSPRTGLDFNEMILLTMKKKSWLVHKQWFLDLPL PWTAGADTSEVHWNHKEPMVTFKTAHAKKQDVVVLGSQEGAMHTALTGATEVDSGDGN LLFTGHLKCRLKNDKLQLKGMSYSMCTGKFQIVKEIAETQHGTILIKVEYKGEDAPCK IPFSTEDEKGKAHNCRLITANPIVIDKEKPVNIEAEPPFGESYIVVGAGDKALKLSWF KKGSSICQMFESTYRGAKRMAILGETAWDFCSVGGVFTSIGKALHQVFGAIYGA 109. 18H2 MTVFFILMMLVAPSYGMRCVGVGNRDFVEGVSGGAWVDLVLEHGSCVTTMAKNKPTLD IELLKTEVTNPATLRKYCIEAKITNITTDSRCPTQGEAILPEEQDQNYVCKHTYVDRC WGNGCCLFGKGSVVTCAKFQCLEPTECKVVQHENLKYTVIITVHTGDQHQVGNDTSNH GVTVEITPQASTVEAILPEYCTLGLECSPRTGLDFNRVVLLTMKKKSWLVHKQWFLDL PLPWTACADTSEVHWNIIKERIVTFKVPHAKKQDVVVLGSQEGAMTALAGATEIQMSS GNLLFTCHLKCRLKTMDKLQLKGMSYSMCTGKFQIVKEIAETQHGTIVMVQYECDGAP CKIPFSTEDGQCKAHNGRLITANPVVTKKEEPVNIEAEPPFGESNIVTGICDKALKIN WYKKGSSTGKNFETTMRGAKRMAILGDTAWDFGSVCCLLTSLGKIVHQIFCSVYTTMF G 110. 18H6 MVVIFTLLMLVTPSMTMRCVCVGNRDFVEGLSCATTVDVVLEHGGGVTTMAKNKPTLD FELIKTTAKEVALLRTYCTEASISNITTATRCPTQGEAILPEEQDQNYVCKHTYVDRG WGNGCCLFGKGSLVTCAKFTCKKNMEGNIVQPENLEYTIVITPHTCDQHQVGNDRQGV TVEITPQASTVEAILPEYGTLGLECSPRTGLDFNRVVLLTMKKKSWLVHKQWFLDLPL PWTAGADTSEVHWNHKERMVTFKNAAKRQDVTVSLCSQEGAMHSALAGATETQMSSGN LLFTGHLKCRLKMDKLQLKGVSYVMCTCSFKLEKEVAETQHGTVLVQVKYECTDAPCK IPFSTEDGQGKAHNCRLITVNPIVIDKEKPVNTELEPPFGESYIVVGAGEKALKLSWF KKGSSICKNFEATARGAKRMAILGETAWDFCSVGGLLTSLGKAVIQVFGATYGA 111. 21C1 MALIFILLTAVAPSMTMRCICISNRDFVEGVSGGAWVDLVLEHGGCVTTMAQGKPTLD PRM15 FELIKTTAKEVALLRTYCIEASISNITTATRCPTQGEPYLKEEQDQQYICRRDVVDRG ETRUNC WGNGCGLFGKGGVVTCAKFKCVTKLEGKIVQYENLEYTVVTVIINGDTHAVGNDTSNH (“E- GVTATITPRSPSVEVKLPDYGELTLDCEPRSGIDFNEMILMKMKKKTWLVHKQWFLDL truncated PLPWTAGADTSEVHWNHKERMVTFKVPHAKRQDVTVLGSQEGAMHSALACATEVDSGD ”or GNIIMFAGHLKCKTRMDKLQLKCMSYSMCTGKFQLVKEIAETQHGTIVMVQYECDGSP “tE”) CKIPLEIMDLEKRHVLCRLITVNPIVIDKEKPVNIEAEPPFCDSYIIIGVGDSALTLH WERKOSSIGKMFESTYRCARRMAILGETAWDFGSVGGLLTSLGKAVHQVFGSVYTTMF G 112. 23C12 MALIFILLMLVTPSMTMRCVGVGNRDFVECLSCATWTDVVLEHCGCVTTMAKNKPTLD PRM15 IELQKTEATQLATLRKLCIEGKITNITTDSRCPTQGEAILPEEQDQNYVCKHTYVDRG ETRUNC WGNGCGLFCKGSLVTCAKFQCLEPTEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGV TVEITPQASTVEAILPEYGTLGLECSPRTCLDFNEMILLTNKNKAWMVHRQWFFDLPL PWTSGATTETPTWNRKELLVTFKNAAKKQEVVVLTGSQEGAMITALTCTTEIQNSCGT SIFAGHLKCRLKMDKLELKGMSYSMCTGKFQIVTKEIAETQHGTIVMVQYEGDGSPCK IPLEIMDLEKPJIVLGRLITVNPIVTEKDSPVNIEAEPPGDSYIIIGVEPGQLKLIWF RKGSSIGQMFETTMRGAKRNAILGETAWIJFGSVGCLLTSLGKAVHQVFGSVYTTMFG 113. 23D5 MGIIFILLMLVTPSMAIRCVGIGNRDFVEGLSGATWVDVVLEHGSCVTTMAKNKpTLD PRM15 IELLKTEVTNPAVLRKLCTEAKISNTTTDSRCPTQGEATLVEEQDTNFVCRRTFVDRC ETRUNC WGNGCGLFGKGSLVTCAKFQCLEPIEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGV TVEITPQAPTSEIQLTDYGALTLDCSPRTGLDFNRVVLLTIKKSWLVHIKQWFLDLPL PWTSGASTSQETWNRQDLLVTFKTAHAKKQEVVTLCSQEGAMHTALTCTTEIQTSGTT TIFAGHLKCRLKMDKLTLKGMSYVMCTGSFKLEKEVAETQHGTTLVQVKYEGTGAPCK VPIEIRDVNKEKVVGRITSSTPLAENTNSVTNIELEPPFGDSYIVIGVGDSALTLHWF RKGSSIGKMFESTARGARRMAILGDTAWDFGSIGGVPTSVGK 114. 23F5 MTVFFVLMMLVAPSMAIIMCVGIGNRDFEGLSGGAWVDLVLEHGGCVTTMAKNKPTLD PRM15 FELIKTEATQPATLRKYCIEAKLTNTTTESRCPTQCEAILPEEQDQNYVCKHTYVDRG ETRIUNC WGNGCGLFGKGSLVTCAKFQGLEPIEGKVVHENLKYTVIITVIITGDQHQVGNDTQGV TVEITPQASTVEAILPEYGTLGLECSPRTGLDFNEMTLLTMKNKTWLVHKQWFLDLPL PWTAGADTREVHWNHKERMVTFKVPHAKRQDVTVLGSQEGAMHSALAGATEVDSGDGN HMFAGHLKCRLKMIDKLELKGMSYSMCTGKFRIVKETAETQHGTIVMVQYEGDGSPCK IPLEIMDLEKRHVLCRLITVNPIVIDKEKPVNIEAEPPFGESYIVVGAGEKALKLSWF KKGSSIGKMFEATARGARRMAILGDTAWDFGSIGGVFTSVGK 115. 23G3 MGIIFILLMLVTPSMAMRCVGIGNRDFVEGLSGATWVDVVLEHGGCVTTMAKNKPTLD PRM15 IELQKTEATQLATLRKLCIEGKITNITTDSRCPTQGEAILPEEQDQNYVCKHTYVDRG ETRUNC WGNGCGLFGKCSLVTCAKFQCLEPIEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGV TVEITPQASTVEALPEYGTLGLECSPRTGLDFNEMILLTNIKNKAWMVHRQWFFDLPL PWTSGATTETPTWNRKELLVTFKNAHAKKQEVVVLGSQEGAMHTALTGATEIQTSGTT TIFAGHLKCRLKNDKLTLKGVSYVMCTGSFKLEKEVAETQHGTVLVQVKYEGTDAPCK IPESSQDEKGVTQNGRLITANPIVIDKEKPVNIEAEPPFGESNIVIGIGDKALKINWF KKGSSIGKNFEATARGARRMAILGDTAWDFGSIGGVFTSIGKALHQVFGAIYGA 116. 23H7 MGIIFILLMLVTPSMTMMCVGIGNRDFVEGLSGATWVDVVLEHGSCVTTMAKNKPTLD PRM15 IELLKTEVTNPAVLRKLCTEAKISNTTTDSRCPTQGEATLVEEQDTNFVCRRTFVDRG ETRUNC WGNGCGLFGKGSLITCAKFKCLEPIEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGV TVEITPQASTVEAILPEYGTLGLECSPRTCLDFNRVVLLTMKKKSWLTIKQWFLDLPL PWTSGASTSQETWNRQDLLVTFKTAHAKKQEVVVLGSQEGAMHTALTGATEIQTSGTT TIFTGHLKCRLKMDKLTLKGVSYVTCTGSFKLEKEVAETQHGTVLVQTKYEGTDAPCK IPFSSQDEKGVTQNGRLITANPVVTKKEEPVNIEAEPPFGESNIVIGIGDKALKINWY KKGSSICKMFEATARGARRMAILGDTAWDFGSIGGVFTSVGK 117. DEN-1 MRSVTMILMLLPTALAFHLTTRGGEPTLIVSKQERGKSLLFKTSAGVNMCTLIAMDLG REST OF ELCEDTMTYKCPRMTEAEPDDVDCWCNATDTWVTYGTCSQTGEHRRDKRSVALDPIVG C15/PRM LGLETRTETWMSSEGAWKQIQKVETWALRHPGFTVIALFLAHAIGTSITQK 118. DEN-2 MNRRRRTVGVIIMLIPTAMAFHLTTRNGEPHMIVGRQEKGKSLLFKTEDCVNMCTLMA REST OF IDLGELCEDTITYKCPLLRQNEPEDIDCWCNSTSTWVTYGTCTTTGEHRMEKRSVALV C15/PRM PHVGMGLETRTETWMSSECAWKHAQRIETWILRHPGFIIMAAILAYTIGTTHFQR 119. DEN-3 MKTSLCLMMMLPATLAFHLTSRDGEPRMIVGKNERGKSLLFKTASGINMCTLIAMDLG REST OF EMCDDTVTYKCPHITEVEPEDIDCWCNLTSTWVTYGTCNQAGERRRDKRSVALAPRIG C15/PRM MGLDTRTQTWMSAEGAWRQVEKVETWALRHPGFTILALFLAHYIGTSLTQK 120. DEN-4 MRSTITLLCLIPTVMAFSLSTRDCEPLMIVAKIERGRPLLFKTTEGINKCTLIAMDLG REST OF EMCEDTVTYKCPLLVNTEPEDIDCWCNLTSTWVMYGTCTQSGERRREKRSVALTPHSG C15/PRM MGLETRAETWMSSEGAWKHAQRVESWILRNPRFALLAGFMAYMIGQTGIQR 121. 21C1 REST MNRRRRTVGVIIMLIPTAMAFHLTTRNGEPHMIVGRQEKGKSLLFKTEDGVNMCTLMA OF IDLGELCEDTVTYKCPLLVNTEPEDIDCWCNLTSAWVTYGTCNQAGEHRRDKRSVALA C15/PRM PHVGMGLETRTETWMSSEGAWKHAQRIETWILRHPGFIIMAAILAYTIGTTHFQR 122. 23C12 MRSVTMILMLLPTALAFHLTTRGGEPTLIVSKQERGKSLLFKTASGINMCTLIAMDLG REST OF EMCDDTVTYKCPHITEVEPEDIDCWCNLTSTWVTYGTCNQAGEHRRDKRSVALVPHVG C15/PRM MGLETRTETWMSSEGAWKIIAQRIETWILRHPGFIIMAAILAYTIGTTHFQR 123. 23D5 REST MKTSLCLMMMLPATLAFHLTSRDGEPRMIVGKNERGKSLLFKTEDGVNNCTLIAMDLG OF ELCEDTMTYKCPRMTEVEPEDIDCWCNLTSTWVTYGTCNQAGEHRRDKRSVALAPHVG C15/PRM MGLDTRTQTWMSAEGAWRQVEKVETWALRHPGFTILALFLAIIYIGTSITQK 124. 23F5 REST MRSAITLLCLIPTVMAFSLSTRDGEPLMIVAKHERGRPLLFKTTEGINKCTLIAMDLG OF EMCEDTVTYKCPLLVNTEPEDIDCWCNLTSTWVMYGTCTQSGERRREKRSVALTPHSG C15/PRM MGLDTRTQTWMSAEGAWKHAQRVESWILRNPRFALLAGFMAYMIGQTGIQR 125. 23G3 REST MKTSLCLMMMLPATLFHLTTRNGEPHMIVGRQEKGKSLLFKTSAGVNMCTLIAIADLG OF ELCEDTMTYKCPRMTEAEPEDIDCWCNLTSTWVTYGTCTQSGERRREKRSVALTPHSG C15/PRM MGLETRAETWMSSEGAWKQIQKVETWALRHPGFTVIALFLAHAIGTSITQK 126 23H7 REST MKTSLCLDIYIMLPATLAFHLTSRDGEPLMVAKHERGRPLLKTTEGINKCTLIAMDLG OF EMCEDTVTYKCPLLRQNEPDDVDCWCNATDTWVTYGTCSQTGEHRRDKRSVALDPHVG C15/PRM LGLETRTETWMSSEGAWKQIQKVETWALRHPGFTVIALFLAHAIGTSITQK 127 DEN-1 LIHQTFGTAYGVLFSGVSWTMKIGIGTLLTWLGLNSRSTSLSMTCIAVGMVTLYLGVM REST OF VQA ENV 128. DEN-2 AFSGVSWTMKILIGVIITWIGMNSRSTSLSVSLVLVGVITLYLGANVQA REST OF ENV 129. DEN-3 ALFSGVSWIMKIGIGVLLTWIGLNSKNTSMSFSCIAIGIITLYLGVVVQA REST OF ENV 130. DEN-4 GVSWMVRTLIGFLVLWIGTNSRNTSMAMSCIAVGGITLFLGFTVHA REST OF ENV 131. 21C1 REST GVSWMVRTLIGFLVLWIGTNSPNTSMAMSCTAVGGITLFLGAIVQA OF ENV 132. 23C2 GVSWMVRTLIGFLVLWIGTNSRNTSMAMSCTAVGGITLFLGFTVHA REST OF ENV 133. 23D5 LTHQIFGTAYGVLFSGVSWTMKTGIGILLTWLGLNSRSTSLSMTCIAVGMVTLYLGVV of ENV VQA 134. 23D5 rest LIHQIFGTAYGVLFSGVSWTNKIGIGILLTWLGLNSRSTSLSMTCIAVGMVTLYLGVM of ENV VQA 135. 23G3 rest AFSGVSWTMKILTGVIITWIGMNSRSTSLSVSLVLVGVITLYLGAMVQA of ENV 136. 23H7 REST LIHQIFGTAYGVLFSGVSWTMKIGIGILLTWLGLNSRSTSLSMTCTAVGMVTLYLGAM OF ENV VQA 137. SEQ G V S W X₁ X₂ X₃I X₄ I G X₅ X₆ X₇ X₈ X₉ X₁₀ N S X₁₁ X₁₂ T PATTERN S X₁₃ X₁₄ X₁₅ X₁₆ X₁₇ X₁₈ X₁₉ X₂₀ G X₂₁ X₂₂ T L X_(23 L G X) ₂₄ X₂₅ V X₂₆ REST OF A, ENV wherein X₁ is a M, I, or T residue, X₂ is a V or M residue, X₃ is a K or R residue, X₄ is a L or G residue, X₅ is a V, I, or F residue, X₆ is a I or L residue, X₇ is an I, L, or V residue, X₈ is a T or L residue, X₉ is an I or L residue, X₁₀ is a M, L, or T residue, is a K or R residue, X₂₂ is a S or N residue, X₁₃ is a L or M residue, X₄ is a S or A residue, X₁₅ is a V, M, or F residue, X₁₆ is a S or T residue, X₁₇ is a L or C residue, X₁₈ is a V or I residue, X₁₉ is a L or A residue, X₂₀ is a V or I residue, X₂₂ is a V, M, I, or G residue, X₂₂ is a V or I residue, X₂₃ is a Y or F residue, X₂₄ is an A, V, or F residue, X₂₅ is a M, V, or T residue, and X₂₆ is a Q or H residue, 138. SEQ X₁ X₂ X₃ X₄ X₅ X₆ X₇ X₈ X₉ X₁₀ X₁₁ P X₁₂ X₁₃ X₁₄ A F X₁₅ L X₁₆ X₁₇ PATTERN R X₁₈ G E P X₁₉ X₂₀ I V X_(21 X) ₂₂ X₂₃ E X₂₄ G X₂₅ X_(26 L L F K T X) ₂₇ REST OF X₂₈ G X₂₉ X₃₀ C T L X₃₁ X₃₂ D L G E X₃₃ C X ₃₄ D T X₃₅ T Y K C15/PRN CPX₃₆ X₃₇ X₃₈ X₃₉ E P X₄₀ E P X₄₁ D X₄₂ D C W C N X₄₃ T X₄₄ X₄₅ M V X₄₆ Y G T C X₄₇ X₄₈ X₄₉ G E X₅₀ R R X₅₁ K R S V A L X₅₂ P H X₅₃ G X₅₄ C L X₅₅ T R X₅₆ X₅₇ T W M S X₅₈ E G A W X₅₉ X₆₀ X₆₁ X₆₂X₆₃ X₆₄ E X₆₅ W X₆₆ L R X₆₇ P X₆₈ F X₆₉ X₇₀ X₇₁ A X₇₂ X₇₃ X₇₄ A X₇₅ X₇₆ I G X₇₇ X₇₈ X₇₉ X₈₀ Q X_(81,) wherein X₁ is a M or R residue, X₂ is a R or K residue, X₃ is a S or T residue, X₄ is a V, S, T, or A residue, X₅ is a T, L, G, or I residue, X₆ is a M, C, V, or T residue, X₇ is an I or L residue, X₈ is a L, M, or I residue, X₉ is a M or C residue, X₁₀ is a L or M residue, X₁₁ is a L or I residue, X₁₂ is a T or A residue, X₁₃ is an A, T, or V residue, X₁₄ is a L or N residue, X₁₅ is a H or S residue, X₁₆ is a T or S residue, X₁₇ is a T or S residue, X₁₈ is a G, D, or N residue, X₁₉ is a T, L, R, or H residue, X₂₀ is a L or M residue, X₂₁ is a S, A, or G residue, X₂₂ is a K or R residue, X₂₃ is a Q, H, or N residue, X₂₄ is a R or K residue, X₂₅ is a K or R residue, X₂₆ is a S or P residue, X₂₇ is a S, T, A, or E residue, X₂₈ is an A, E, S, or D residue, X₂₉ is a V or I residue, X₃₀ is a N or K residue, X₃₁ is an I or N residue, X₃₂ is a M or I residue, X₃₃ is a L or N residue, X₃₄ is an E or D residue, X₃₅ is a N, V, or I residue, X₃₆ is a R, L, or H residue, X₃₇ is a M, L, or I residue, X₃₈ is a T, R, or V residue, X₃₉ is an E, Q, or N residue, X₄₀ is an A, N, V, or T residue, X₄₁ is a D or E residue, X₄₂ is a V or I residue, X₄₃ is an A, L, or S residue, X₄₄ is a D or S residue, X₄₅ is a T or A residue, X₄₆ is a T or M residue, X₄₇, is a S, N, or T residue X₄₈ is a Q or T residue, X₄₉ is a T, A, or S residue, X₅₀ is a H or R residue, X₅₁ is a D or E residue, X₅₂ is a D, A, T, or V residue, X₅₃ is a V or S residue, X₅₄ is a L or M residue, X₅₅ is an E or D residue, X₅₆ is a T or A residue, X₅₇ is an E or Q residue, X₅₈ is a S or A residue, X₅₉ is a K or R residue, X₆₀ is a H or Q residue, X₆₁ is an I, V, or A residue, X₆₂ is a Q or E residue, X₆₃ is a K or R residue, X₆₄ is a V or I residue, X₆₅ is a T or S residue, X₆₆ is an A or I residue, X₆₇ is a H or N residue, X₁₆₈ is a C or R residue, X₁₆₉ is a T, I, or A residue, X₁₇₀ is a V, I, or L residue, X₇₁ is an I, L, or M residue, X₇₂ is a L, A, or C residue, X₇₃ is a F or I residue, X₇₄ is a L or M residue, X₇₅ is a H or Y residue, X₇₆ is an A, Y, T, or M residue, X₇₇ is a T or Q residue, X₇₈ is a T or S residue, X₇₉ is an I, L, H, or C residue, X₈₀ is a T, F, or I residue, X₈₁ and is a K or R residue, 139. 2G11-D4 MRSTITLLCLIPTVMAFSLSTRDGEPLMIVAKHERGRPLLFKTTEGINKCTLIAMDLG EMCEDTVTYKCPLLVNTEPEDIDCWCNLTSTWVMYGTCTQSGERPREKRSVALTPHSC MGLETRAETWNSSEGAWKHAQRTESWILRNPRFALLAGFMAYMIGQTGIQRTVFFVLN MLVAPSYGNRCVGVGNRDFVEGVSCATWVDVVLEHCGCVTTMAKNKPTLDFELIKTTA KEVALLRTYCIEASISNITTATRCPTQGEPYLKEEQDQQYICRRDVVDRGWGNGCGLF GKGSLITCAKFKCVTKLEGNIVQPENLEYTTVITPHTCDQHQVGNDTQGVTVEITPQA STVEAILPEYGTLGLECSPRTGLDFNEMILLTMKNKAWMVHRQWFFDLPLPWTSGATT ETPTWNRKELLVTFKNAHAKKQEVVVLGSQEGAMHSALAGATETQMSSCNLLFTGHLK CRLKMDKLQLKGMSYTMCTGKFQIVKETAETQHGTILIKVEYKGEDAPCKTPFSTEDE KGVRQNGRLITANPTVTEKDSPVNIEAEPPFGESYIVVGAGEKALKLSWFKKGSSIGQ MFESTYRCAKRMAILGETAWDFGSVGGLLTSLGKAVHQVFGSVYTTMFGGVSWNVRTL IGFLVLWTGTNSRNTSMAMSCTAVGGTTLFLGFTVHA 140. 5/21-D1 MRSVTMILMLLPTALAFHLTTRGGEPTLIVSKQERGKSLLFKTSAGVNMCTLIAMDLG ELCEDTMTYKCPPMTEAEPDDVDCWGNATDTWVTYGTCSQTGEHRRDKRSVALDPHVG LCLETRTETWMSSEGAWKQTQKVETWALRHPGFTVIALFLAHAIGTSITQKGIIFILL MLVTPSMAIRCVGIGNRDFVEGLSGATWVDVVLEHGSCVTTMAKNKPTLDIELLKTEV TNPATLRKYCIEAKITNITTDSRCPTQGEAILPEEQDQNYVCKHTYVDRGWGNGCGLF GKGSLVTCAKFTCKKNMEGNTVQPENLEYTIVITPHSGEEHAVGNDTGKHGKEIKTTP QSSTTEAELTGYGTVTMECSPRTCLDFNRVVLLTNKKKSWLVHKQWFLDLPLPWTAGA DTSEVHWNHKERMVTFKVPHAKRQDVTVLGSQEGAMHSALAGATEIQMSSGNLLFTGH LKCRLRMDKLQLKGMSYVMCTGKFQIVKEIAETQHGTTIVMVQYEGDGSPCKTPLEIM DLEKRIVLGRLITVNPIVIDKEKPVNIEAEPPFCESYIVVGVGDSALTLHWFRKGSST GQMFETTMRGAKRMAILGDTAWDFGSIGGVFTSVGKLTHQIFGTAYGVLFSGVSWTMK IGIGILLTWLGLNSRSTSLSMTCIAVGMVTLYLGVMVQA 141. 6E13-D4 MRSTITLLCLIPTVMAFSLSTRDGEPLMTVAKHERGRPLLFKTTEGINKCTLIAMDLG EMCEDTVTYKCPLLVNTEPEDTDCWCNLTSTWVMYGTCTQSGERRREKRSVALTPHSG MGLETPAETWMSSEGAWKHAQRVESWILPNPRFALLAGFMAYMIGQTGIQRTVFFVLM MLVAPSYGMRCVGVGNRDFVEGVSGATWVDVVLEHGSCVTTMAKNKPTLDIELLKTEV TNPAVLRKLCIEASISNTTTATRCPTQGEAILPEEQDQNYVCKHTYVDRGWGNGCGLF GKGSVVTCAKFQCLEPTEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGVTTEITPQA STVEATLPEYGTLGLECSPRTGLDFNEMILLTMKNKAWMVHRQWFFDLPLPWTAGADT SEVHWNHKERMVTFKTAHAKKQDVVVLGSQEGAMITALTGATEIQMSSGTTTIFAGHL KCKVRADKLTLKGVSYVMCTGSFKLEKEVAETQHGTVLVQVKYEGTDAPCKIPFSSQD EKGVTQNGRLITANPTVIDKEKPVNIEAEPPFGDSYIIIGVEPGQLKLHWFKKGSSIG QMFETTMRGAKRMAILGDTAWDFGSVGGLLTSLGKMVHQIFGSVYTTMFGGVSWMVRI LIGFLVLWIGTNSRNTSMAMSCIAVGGITLFLGFTVIA 142. 21C1 MNRRRRTVGVIIMLIPTAMAFHLTTRNGEPHMIVGRQEKGKSLLFKTEDGVNICTLMA C15/full) IDLGELCEDTVTYKCPLLVNTEPEDTDCWCNLTSAWVTYGTCNQAGEHRRDKRSVALA prM/full PHVGMGLETRTETWMSSEGAWKHAQRIETWILRHPGFIIMAAILAYTIGTTHFQRALI E) FILLTAVAPSMTMMCIGISNRDFVEGVSGGAWVDLVLEHGGCVTTMAQGKPTLDFELI KTTAKEVALLRTYCIEASISNITTATRCPTQGEPYLKEEQDQQYICRRDVVDRGWGNG CGLFGKGGVVTCAKFKCVTKLEGKIVQYENLEYTVVVTVHNGDTHAVGNDTSNHGVTA TITPRSPSVEVKLPDYGELTLDCEPRSGIDFNEMILMKMKKKTWLVHKQWFLDLPLPW TAGADTSEVHWNHKERMVTFKVPHAKRQDVTVLGSQEGAMHSALAGATEVDSGDGNHM FAGHLKCKVRMDKLQLKGMSYSMCTGKTFQLVKEIAETQHGTIVMVQYEGDGSPCKIP LEIMDLEKRHVLGRLITVNPIVIDKEKPVNIEAEPPFGDSYIIIGVGDSALTLHWFRK GSSIGKMFESTYRGARPMAILGETAWDFGSVGGLLTSLGKAVHQVFGSVYTTMFGGVS WMVRILIGFLVLWIGTNSPNTSMAMSCIAVGGITLFLGAIVQA 143. 23C12 MRSVTMILMLLPTALAFHLTTRGGEPTLIVSKQERGKSLLFKTASGINMCTLTAMDLG (Cl5/full EMCDDTVTYKCPHITEVEPEDIDCWCNLTSTWVTYGTCNQAGEHPRDKRSVALVPHVG prM/full MGLETRTETWMSSEGAWKHAQRIETWILRHPGFIIMAAILAYTIGTTHFQPALIFILL E) MLVTPSMTMRCVGVGNRDFVEGLSGATWVDVVLEHGGCVTTMAKNKPTLDTELQKTEA TQLATLRKLCIEGKITNITTDSRCPTQGEAILPEEQDQNYVCKHTYVDRGWGNGCGLF GKGSLVTCAKFQCLEPIEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGVTVEITPQA STVEAILPEYGTLGLECSPRTTGLDFNEMILLTMKNAWMVHRQWFFDLPLPWTSGATT ETPTWNRKELLVTFKNAHAKKQEVVVLGSQEGAMITALTGTTEIQNSGGTSIFAGHLK CRLKMIDKLELKGMSYSMCTGKFQIVKEIAETQHGTIVMVQYEGDGSPCKIPLEIMDL EKRHVLGRLITVNPIVTEKDSPVNIEAEPPFGDSYIIIGVEPGQLKLHWFRKGSSIGQ MFETTMRGAKRMAILGETAWDFGSVGGLLTSLGKAVHQVFGSVYTTMFGGVSWMVRIL IGFLVLWIGTNSRNTSMAMSCTAVGGTTLFLGFTVIA 144. 23D5 MKTSLCLMIVMLPATLAFHLTSRDGEPRMIVGKNERGKSLLFKTEDGVNCTLIAMDLG (C15/full) ELCEDTMTYKCPRMTEVEPEDIDCWCNLTSTWVTYGTCNQAGEHRRDKRSVALAPHVG prM/full MGLDTRTQTWMSAEGAWRQVEKVETWALRHPGFTTLALFLAHYIGTSITQKGIIFILL E) MLVTPSMAMRCVGIGNRDFVEGLSGATWVDVVLEHGSCVTTMAKNKPTLDIELLKTEV TNPAVLRKLCIEAKISNTTTDSRCPTQGEATLVEEQDTNFVCRRTFVDRGWGNGCGLF GKGSLITCAKFKCVTKLEGKIVQYELKYSVIVTVIINGDTHAVGNDTQGVTVEITPQA PTSEIQLTDYGALTLDCSPRTGLDFNRVVLLTMKKKSWLVHKQWFLDLPLPWTSGAST SQETWNRQDLLVTFKTAHAKKQEVVVLGSQEGAMHTALTGTTEIQTSGTTTIFAGHLK CRLKMDKLTLKGMSYVMCTGSFKLEKEVAETQHGTVLVQVKYEGTGAPCKVPIEIRDV NKEKVVGRIISSTPLAENTNSVTNIELEPPFGDSYIVIGVGDSALTLHWFRKGSSIGK MFESTARGARRMAILGDTAWDFGSICGVFTSVGKLIHQIFGTAYGVLFSCVSWTMKIG IGILLTWLGLNSRSTSLSMTCTAVGMVTLYLGVVVQA 145. 23F5 MRSAITLLCLIPTVMAFSLSTRDGEPLMIVAKHERGRPLLFKTTEGINKCTLIANDLG C15/full EMCEDTVTYKCPLLVNTEPEDIDCWCNLTSTWVMYGTCTQSGERRREKRSVALTPHSG prM/full MGLDTRTQTWMSAEGAWKHAQRVESWILPNPRFALLAGFMAYMIGQTGIQRTVFFVLM E) MLVAPSMAMRCVGIGNRDFVEGLSGGAWVDLVLEHGGCVTTMAKNKPTLDFELIKTEA TQPATLRKYCIEAKLTNTTTESRCPTQGEAILPEEQDQNYVCKHTYVDRGWGNGCGLF GKGSLVTCAKFQCLEPIEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGVTVEITPQA STVEAILPEYGTLGLECSPRTGLDFNEMILLTMKNKTWLVHKQWFLDLPLPWTAGADT REVHWNHKERMVTFKVPHAKRQDVTVLGSQEGAMHSALAGATEVDSDGNIIMFAGHLK CRLKMDKLELKGMSYSMCTGKFRTVKETIAETQHGTIVMVQYEGDGSPCKIPLEIMDL EKRHVLGRLITVNPIVIDKEKPVNIEAEPPFGESYIVVGAGEKALKLSWFKKGSSTGK MFEATARGARRMAILGDTAWDFGSIGGVFTSVGKLIHQIFGTAYGVLFSGVSWTMKTG IGILLTWLGLNSRSTSLSMTCIAVGMVTLYLGVMVQA 146. 23G3 MKTSLCLMMMLPATLAFHLTTRNGEPHMIVGRQEKGKSLLFKTSAGVNMCTLIAMDLG (C15/full ELCEDTMTYKCPRMTEAEPEDIDCWCNLTSTWVTYGTCTQSGERRREKRSVALTPHSG prM/full MGLETRAETWMSSEGAWKQIQKVETWALRHPGFTVIALFLAHAIGTSITQKGIIFILL E) MLVTPSMAMPCVGIGNRDFVEGLSGATWVDVVLEHGGCVTTMAKNKPTLDIELQKTEA TQLATLRKLCIEGKITNITTDSRCPTQGEAILPEEQDQNYVCKHTYVDRGWGNGCGLF GKGSLVTCAKFQCLEPIEGKVVQHENLKYTVTTTVHTGDQHQVGNDTQGVTVETTPQA STVEAILPEYGTLGLECSPRTGLDFNEMILLTMKNKAWMVHRQWFFDLPLPWTSGATT ETPTWNRKELLVTFKNAHAKKQEVVVLGSQEGAIHTALTGATEIQTSGTTTIFAGHLK CRLKMDKLTLKGVSYVMCTGSFKLEKEVAETQHGTVLVQVKYEGTDAPCKIPFSSQDE KGVTQNGRLITANPIVIDKEKPVNIEAEPPFGESNIVIGTGDKALKTNWFKKGSSICK MFEATARGARRMAILGDTAWDFGSIGGVFTSIGKALHQVFCAIYGAAFSGVSWTNKIL IGVIITWIGMNSRSTSLSVSLTLVGVITLYLGAMVQA 147. 2/7-D1 MRSVTMILMLLPTALAFHLTTRGGEPTLIVSKQERGKSLLFKTSAGVNMCTLIAI4LG extended ELCEDTITYKCPRMTEAEPDDVDCWCNATDTWVTYGTCSQTGEHRRDKRSVALDPHVG wth D1 LGLETRTETWMSSEGAWKQIQKVETWALRHPGFTVIALFLAHAIGTSITQKGIIFILL to MLVTPSMAIRCVGIGNRDFVEGLSGATWVDVVLEHGSCVTTMAKNKPTLDIELIKTEA C15/full TQPATLRKYCIEAKLTNTTTESRCPTQGEPYLKEEQDQNYVCKHTYVDRGWGNGCGLF prM/full GKGSLITCAKFKCVTKLEGKIVQYENLKYSVIVTVHTGDQHQVGNETTEHGTIATITP E with QAPTSEIQLTDYGALTLDCSLPRTGLDFNRVVLLTMKKTWLVHKQWFLDLPLPWTAGA DEN-1 STSQETWNRKELLVTFKNAHAKKQEVVVLGSQECAMHTALTGATEVDSGDGNLLPTGH amino LKCRLKMDKLQLKGMSYSMCTGKFQIVKEIAETQHGTIVMVQYEGMDGAPCKIPFSTE acid DGQGKAHNGRLITANPIVIDKEKPVNIELEPPFGDSYIVVGAGDKALKTNWYKKGSSI resdues GKMFESTYRGAKPMAILGDTAWDFGSIGGVFTSVGKLIHQIFGTAYGVLFSGVSWTMK IGIGILLTWLGLNSRSTSLSMTCIAVGMVTLYLGVMVQA 148. 23H7 MTSLCLMMMLPATLAFHLTSRDGHEPLMIVAKHERGRPLLFKTTEGINKCTLIAMDLG (C15/full EMCEDTVTYKCPLLRQNEPDDVDCWCNATDTWVTYGTCSQTGEHRRDKRSVALDPHVG prM/full LGLETRTETWMSSEGAWKQIQKVETWALRHpGFTVIALFLAHAIGTSITQKGIIFILL E) MLVTPSMTNRCVGIGNRDFVEGLSGATWVDVVLEHGSCVTTNAKNKPTLDIELLKTEV TNPAVLRKLCIEAKISNTTTDSRCPTQGEATLVEEQDTNFVCRRTFVDRGWGNGCGLF GKGSLITCAKFKCLEPIEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGVTVETTPQA STVEAILPEYGTLGLECSPRTGLDFNRVVLLTMKKKSWLVHKQWFLDLPLPWTSGAST SQETWNRQDLLVTFKTAHAKKQEVVVLGSQEGAMHTALTGATETQTSGTTTIFTGHLK CRLKMDKLTLKGVSYVTCTGSFKLEKEVAETQHGTVLVQVKYEGTDAPCKIPFSSQDE KGVTQNGRLITANPVVTKKEEPVNIEAEPPFGESNIVIGIGDKALKTNWYKKGSSIGK MFEATARGARRMAILGDTAWDFGSIGGVFTSVGKLIHQIFGTAYGVLFSGVSWTMKIC IGILLTWLGLNSRSTSLSMTCIAVGMVTLYLGAIVQA 149. DEN-1 MGIIFILLMLVTPSMAMRCVGIGNPDFVEGLSGATWVDVVLEHGSCVTTMAKNKPTLD PRM15/ IELLKTEVTNPAVLRKLCIEAKISNTTTDSRCPTQGEATLVEEQDTNFVCRRTFVDRG tRunc E- WGNGCGLFGKGSLITCAKFKCVTKLEGKIVQYENLKYSVIVTVHTGDQHQVGNETTEH (trun- GTIATITPQAPTSETQLTDYGALTLDCSPRTGLDFNRVVLLTMKKKSWLVHKQWFLDL cated PLPWTSGASTSQETWNRQDLLVTFKTAHAKKQEVVVLGSQEGAMHTALTGATEIQTSG envelope) TTTIFAGHLKCRLKMDKLTLKGVSYVMCTGSFKLEKEVAETQHGTVLVQVKYEGTDAP parent CKIPFSSQDEKGVTQNGRLITANPIVIDKEKPVNIEAEPPFGESYIVVGAGEKALKLS WFKKGSSIGKNFEATARGARRMAILGDTAWDLGSICGVFTSVGK 150. DEN-2 MGLILILQTAVAPSMTMRCIGISNRDFVEGVSGGSWVDIVLEHGSCVTTMAKNKPTLD PRM15/ FELIKTEATQPATLRKYCIEAKLTNTTTESRCPTQGEPSLNEEQDKRFVCKHSNVDRG trunc E WGNGCGLFGKGGIVTCAMFTCKKNMEGNIVQPENLEYTTVITPHSGEEHAVGNDTGKH parent GKEIKITPQSSITEAELTGYGTVTMECSPRTGLDFNEIVLLQMEDKAWLVHRQWFLDL PLPWLPGADTQGSNRIQKETLVTFKNPHAKKQDVVVLGSQEGAMHTALTGATEIQMSS GNLLFTGHLKCRLRMDKLQLKGMSYSMCTFGKFQIVKEIAETQHGTTVMVQYEGDGSP CKIPLEINDLEKRIVLGRLITVNPIVTEKDSPVNIEAEPPFGDSYTIIGVEPGQLKLH WFKKGSSIGQMFETTMRGAKPMAILGDTAWDFGSLGGVFTSTGKALHQVFGAIYGA 151. DEN-3 MVVIFILLMLVTPSMTNRCVGVGNRDFVEGLSGATWVDVVLEHGGCVTTMAKNKPTLD PRM15/ IELQKTEATQLATLRKLCIEGKITNITTDSRCPTQGEAILPEEQDQNYVCKHTYVDRG trunc E WGNGCGLFGKGSLVTCAKFQCLEPIEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGV Parent TVEITPQASTVEAILPEYGTLGLECSPRTGLDFNEMILLTMKNKAWMVHRQWFFDLPL PWTSGATTETPTWNRKELLVTFKNAHAKKQEVVVLGSQEGAMHTALTGTTEIQNSGGT SIFAGHLKCRLKNDKLELKGMSYAMCLNTFVLKKEVSETQHGTILTKVEYKGEDAPCK IPFSTEDGQGKAHNGRLITANPVVTKKEEPVNIEAEPPFGESNTVTGIGDKALKINWY KKGSSIGKMFEATARGARRNAILGDTAWDFGSVGGVLNSLGKMVHQIFGSAYT 152. DEN-4 MTVFFILMMLVAPSYCMRCVGVGNRDFVEGVSGGAWVDLVLEHGGCVTTNAQGKPTLD FELIKTTAKEVALLRTYCIEASISNITTATRCPTQGEPYLKEEQDQQYICRRDVVDRG trunc E WGNGCGLFGKGGVVTCAKFSCSGKITGNLVQTENLEYTVVVTVHNGDTHAVGNDTSNH Parent GVTATITPRSPSVEVKLPDYGELTLDCEPRSGIDFNEMILMKMKKKTWLVHKQWFLDL PLPWTAGADTSEVHWNHKERMVTFKVPHAKRQDVTVLGSQEGAMHSALAGATEVDSGD GNHMFAGHLKCKVPMEKLRIKGMSYTMCSGKFSIDKEMAETQHCTTVVKVKYEGTGAP CKVPIEMDVNKEKVVGRIISSTPLAENTNSVTNIELEPPFGDSYRIVIGVGDSALTLH WFRKGSSIGKMFESTYRGAKRMAILGETAWDFGSVGGLLTSLGKAVHQVFGSVYTTMF G 153. 23D5 MRCVGIGNRDFVEGLSGATWVDVVLEHGSCVTTMAKNKPTLDIELLKTEVTNPAVLRK ETRUNC LCIEAKISNTTTDSRCPTQGEATLVEEQDTNFVCRRTFVDRGWGNGCGLFGKGSLITC (“trunc AKFKCVTKLEGKIVQYENLKYSVIVTVHNGDTHAVGNDTQGVTVEITPQAPTSEIQLT E” or DYGALTLDCSPRTCLDFNRVVLLTMKKKSWLVHKQWFLDLPLPWTSGASTSQETWNRQ “truncate DLLVTFKTAHAKKQEVVVLGSQEGAMITALTGTTEIQTSGTTTIFAGHLKCRLKMDKL d E) TLKGMSYVMCTGSFKLEKEVAETQHGTVLVQVKYEGTGAPCKVPIEIRDVNKEKVVGR only (no IISSTPLAENTNSVTNIELEPPFGDSYIVTCVCDSALTLHWFRKGSSIGKMFESTARC prM) ARRMAILGDTAWDFGSICGTFTSVCK 154. 23G3 MRCVGIGNRDFVEGLSGATWVDVVLEHGGCVTTMAKNKPTLDIELQKTEATQLATLRK ETRUNC LCIEGKITNITTDSRCPTQGEAILPEEQDQNYVCKHTYTDRGWCNGCGLFGKGSLVTC (“trunc AKFQCLEPIECKVVQHENLKYTVIITVHTGDQHQVGNDTQGVTVEITPQASTVEAILP E”) only EYGTLGLECSPRTGLDFNEMILLTMKNKAWMVHRQWFFDLPLPWTSGATTETPTWNRK (no prM) ELLVTFKNAHAKKQEVVVLGSQEGAMHTALTGATEIQTSGTTTTFAGHLKCRLKMDKL TLKGVSYVMCTGSFKLEKEVAETQHGTVLVQVKYEGTDAPCKIPFSSQDEKGVTQNGR LITANPIVIDKEKPVNIEAEPPFGESNTVTGIGDKALKINWFKKGSSIGKMFEATARG ARRMAILGDTAWDFGSIGGVFTSIGKALHQVFGAIYGA 155. 23H7 MRCVGIGNRDFVEGLSGATWVDVVLEHGSCVTTMAKNKPTLDIELLKTEVTIPAVLRK ETRUNC LCIEAKISNTTTDSRCPTQGEATLVEEQDTNFVCRRTFVDRGWGNGCGLFGKGSLITC only (no AKFKCLEPTEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGVTVEITPQASTVEAILP prM) EYGTLGLECSPRTGLDFNRVVLLTMKKKSWLVHKQWFLDLPLPWTSGASTSQETWNRQ DLLVTFKTAHAKKQEVVVLGSQEGAMHTALTGATEIQTSGTTTIFTGHLKCRLKMDKL TLKGVSYVTCTGSFKLEKEVAETQHGTVLVQVKYEGTDAPCKIPFSSQDEKGVTQNGR LITANPVVTKKEEPVNIEAEPPFGESNIVIGIGDKALKINWYKKGSSTGKMFEATARG ARRMATLGDTAWDFGSTGGVFTSVGK 156. 2/7 atggtggtgatcttcatcctgctgatgctggtgaccccctccatgacaatgaggtgca Round I tcggcatcagcaacagggacttcgtggagggcgtgagcggcgccacctgggtggacgt shuffled ggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcccaccctggac clone atcgagctgatcaagaccgaggccacccagcccgccaccctgaggaagtactgcatcg aggccaagctgaccaataccaccaccgagagcaggtgcccoacccagggcgagcccta cctgaaggaggagcaggaccagaactacgtgtgcaagcacacctacgtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgcgccaagttca agtgcgtgaccaagctggagggcaagatcgtgcagtacgagaacctgaagtacagcgt gatcgtgaccgtgcacaccggcgaccagcaccaggtgggcaacgagaccaccgagcac ggcaccatcgccaccatcaccccccaggcccccaccagcgagatccagctgaccgact acggcgccctgaccctggactgcagccccaggaccggcctggacttcaacagggtggt gctgctgaccatgaagaagaagacctggctggtgcacaagcagtggttcctggacctg cccctgccctggaccgccggcgccagcaccagccaggagacctggaacaggaaggagc tgctggtgaccttcaagaacgcccacgccaagaagcaggaggtggtggtgctgggcag ccaggagggcgccatgcacaccgccctgaccggcgccaccgaggtggacagcggcgac ggcaacctgctgttcaccggccacctgaagtgcaggctgaagatggacaagctgcagc tgaagggcatgagctacagcatgtgcaccggcaagttccagatcgtgaaggagatcgc cgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcgacggcgccccc tgcaagatccccttcagcaccgaggacggccagggcaaggcccacaacggcaggctga tcaccgccaaccccatcgtgatcgacaaggagaagcccgtgaacatcgagctggagcc ccccttcggcgacagctacatcgtggtgggcgccggcgacaaggccctgaagatcaac tggtacaagaagggcagcagcatcggcaagatgttcgagagcacctacaggggcgcca agaggatggccatcctgggcgagaccgcctgggacttcggcagcgtgggcggcgtgtt caccagcatcggcaaggccctgcaccaggtgttcggcgccatctacggcgcc 157. 2G11 atgaccgtgttcttcatcctgctgatgctggtgaccccctctatggccatgaggtgcg Round I tgggcatcggcaaccgcgacttcgtggagggcctgagcggcgccacctgggtggacgt shuffled ggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcccaccctggac clone ttcgagctgatcaagaccaccgccaaggaggtggccctgctgaggacctactgcatcg aggccagcatcagcaacatcaccaccgccaccaggtgccccacccagggcgagcccta cctgaaggaggagcaggaccagcagtacatctgcaggagggacgtggtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgcgccaagttca agtgcgtgaccaagctggagggcaacatcgtgcagcccgagaacctggagtacaccat cgtgatcaccccccacaccggcgaccagcaccaggtgggcaacgacacccagggcgtg accgtggagatcaccccccaggccagcaccgtggaggccatcctgcccgagtacggca ccctgggcctggagtgcagccccaggactggcctggacttcaacgagatgatcctgct gaccatgaagaacaaggcctggatggtgcacaggcagtggttcttcgacctgcccctg ccctggaccagcggcgccaccaccgagacccccacctggaacaggaaggagctgctgg tgaccttcaagaacgcccacgccaagaagcaggaggtggtggtgctgggcagccagga gggcgccatgcacagcgccctggccggcgccaccgagatccagatgagcagcggcaac ctgctgttcaccggccacctgaagtgcaggctgaagatggacaagctgcagctgaagg gcatgagctacaccatgtgcaccggcaagttccagatcgtgaaggagatcgccgagac ccagcacggcaccatcctgatcaaggtggagtacaagggcgaggacgccccctgcaag atccccttcagcaccgaggacgagaagggcgtgacccagaacggcaggctgatcaccg ccaaccccatcgtgaccgagaaggacagccccgtgaacatcgaggccgagcccccctt cggcgagagctacatcgtggtgggcgccggcgagaaggccctgaagctgagctggttc aagaagggcagcagcatcggccagatgttcgagagcacctacaggggcgccaagagga tggccatcctgggcgagaccgcctgggacttcggcagcgtgggcggcctgctgaccag cctgggcaaggccgtgcaccaggtgttcggcagcgtgtacaccaccatgttcggc 158. 5/21 atgaccgtgttcttcatcctgatgatgctggtggccccctcctacggcatgaggtgcg Round I tgggcgtgggcaacagggacttcgtggagggcctgagcggcgccacctgggtggacgt shuffled ggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcccaccctggac clone atcgagctgctgaagaccgaggtgaccaaccccgccaccctgaggaagtactgcatcg aggccaagatcaccaacatcaccaccgacagcaggtgccccacccagggcgaggccat cctgcccgaggagcaggaccagaactacgtgtgcaagcacacctacgtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcctggtgacctgcgccaagttca cctgcaagaagaacatggagggcaacatcgtgcagcccgagaacctggagtacaccat cgtgatcaccccccacagcggcgaggagcacgccgtgggcaacgacactggcaagcac ggcaaggagatcaagatcaccccccagagcagcatcaccgaggccgagctgaccggct acggcaccgtgaccatggagtgcagccccaggaccggcctggacttcaacagggtggt gctgctgaccatgaagaagaagagctggctggtgcacaagcagtggttcctggacctg cccctgccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaaggaga ggatggtgaccttcaaggtgccccacgccaagaggcaggacgtgaccgtgctgggcag ccaggagggcgccatgcacagcgccctggccggcgccaccgagatccagatgagcagc ggcaacctgctgttcaccggccacctgaagtgcaggctgaggatggacaagctgcagc tgaagggcatgagctacgtgatgtgcaccggcaagttccagatcgtgaaggagatcgc cgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcgacggcagcccc tgcaagatccccctggagatcatggacctggagaagaggcacgtgctgggcaggctga tcaccgtgaaccccatcgtgatcgacaaggagaagcccgtgaacatcgaggccgagcc ccccttcggcgagagctacatcgtggtcggcgtgggcgacagcgccctgaccctgcac tggttcaggaagggcagcagcatcggccagatgttcgagaccaccatgaggggcgcca agaggatggccatcttgggcgacaccgcctgggacttcggcagcctgggcggcgtgtt caccagcatcggcaaggccctgcaccaggtgttcggcgccatctacggcgcc 159. 6E12 atgaccgtgttcttcatcctgatgatgctggtggccccctcctacggcatgaggtgcg Round I tgggcgtgggcaacagggacttcgtggagggcctgagcggcgccacctgggtggacgt shuffled ggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcccaccctggac clone atcgagctgctgaagaccgaggtgaccaaccccgccgtgctgaggaagctgtgcatcg aggccagcatcagcaacatcaccaccgccaccaggtgccccacccagggcgaggccat cctgcccgaggagcaggaccagaactacgtgtgcaagcacacctacgtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcgtggtgacctgcgccaagttcc agtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctgaagtacaccgt gatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgacacccagggcgtg accgtggagatcaccccccaggccagcaccgtggaggccatcctgcccgagtacggca ccctgggcctggagtgcagccccaggaccggcctggacttcaacgagatgatcctgct gaccatgaagaacaaggcctggatggtgcacaggcagtggttcttcgacctgcccctg ccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaaggagaggatgg tgaccttcaagaccgcccacgccaagaagcaggacgtggtggtgctgggcagccagga gggcgccatgcacaccgccctgaccggcgccaccgagatccagatgagcagcggcacc accaccatcttcgccggccacctgaagtgcaaggtgaggatggacaagctgaccctga agggcgtgagctacgtgatgtgcaccggcagcttcaagctggagaaggaggtggccga gacccagcacggcaccgtgctggtgcaggtgaagtacgagggcaccgacgccccctgc aagatccccttcagcagccaggacgagaagggcgtgacccagaacggcaggctgatca ccgccaaccccatcgtgatcgacaaggagaagcccgtgaacatcgaggccgagccccc ctttggcgacagctacatcatcatcggcgtggagcccggccagctgaagctgcactgg ttcaagaagggcagcagcatcggccagatgttcgagaccaccatgaggggcgccaaga ggatggccatcctgggcgacaccgcctgggacttcggcagcgtgggcggcctgctgac cagcctgggcaagatggtgcaccagatcttcggcagcgtgtacaccaccatgttcggc 160. 6C6 atgaccgtgttcttcatcctgatgatgctggtgaccccctctatggccatgaggtgcg Round I tgggcatcggcaaccgcgacttcgtggagggcgtgagcggcggcgcctgggtggacct shuffled ggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcccaccctggac clone ttcgagctgatcaagaccaccgccaaggaggtggccctgctgaggacctactgcatcg aggccagcatcagcaacatcaccaccgccaccaggtgccccacccagggcgagcccta cctgaaggaggagcaggaccagcagtacatctgcaggagggacgtggtggacaggggc tggggcaacggctgcggcctgttcggcaagggcggcatcgtgacctgcgccatgttca cctgcaagaagaacatggagggcaacatcgtgcagcccgagaacctggagtacaccat cgtgatcaccccccacagcggcgaggagcacgccgtgggcaacgacactggcaagcac ggcaaggagatcaagatcaccccccagagcagcatcaccgaggccgagctgaccggct acggcaccgtgaccatggagtgcagccccaggaccggcctggacttcaacgagatgat cctgatgaagatgaagaacaaggcctggatggtgcacaggcagtggttcctggacctg tgcctgccctggaccgccggcgccgacaccgagacccccacctggaacaggaaggagc tgctggtgaccttcaagaacgcccacgccaagaagcaggaggtggtggtgctgggcag ccaggagggcgccatgcacagcgccctggccggcgccaccgaggtggacagcggcgac ggcaaccacatgttcgccggccacctgaagtgcaaggtgaggatggacaagctgcagc tgaagggcatgagctacagcatgtgcaccggcaagttccagatcgtgaaggagatcgc cgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcgacggcagcccc tgcaagatccccctggagatcatggacctggagaagaggcacgtgctgggcaggctga tcactgtgaaccccatcgtgaccgagaaggacagccccgtgaacatcgaggccgagcc ccccttcggcgacagctacatcatcatcggcgtggagcccggccagctgaagctgcac tggttcaagaagggcagcagcatcggcaagatgttcgaggccaccgccaggggcgcca ggaggatggccatcctgggcgacaccgcctgggacttcggcagcgcctacacc 161. 6F4 atggccgtgttcttcatcctgctgatgctggtgaccccctccatgacaatgaggtgcg Round I tgggcatcggcaaccgcgacttcgtggagggcgtgagcggcggcgcctgggtggacct shuffled ggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcccaccctggac clone ttcgagctgatcaagaccgaggccacccagcccgccaccctgaggaagtactgcatcg aggccaagctgaccaataccaccaccgagagcaggtgccccacccagggcgaggccat cctgcccgaggagcaggaccagcagtacatctgcaggagggacgtggtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgcgccaagttcc agtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctgaagtacaccgt gatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgacacccagggcgtg accgtggagatcaccccccaggccagcaccgtggaggccatcctgcccgagtacggca ccctgggcctggagtgcagccccaggaccggcctggacttcaacgagatgatcctgct gaccatgaagaagaagagctggctggtgcacaagcagtggttcctggacctgcccctg ccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaaggagaggatgg tgaccttcaaggtgccccacgccaagaggcaggacgtgaccgtgctgggcagccagga gggcgccatgcacagcgccctggccggcgccaccgaggtggacagcggcgacggcaac cacatgttcgccggccacctgaagtgcaaggtgaggatggagaagctgcgcatcaagg gcatgagctacagcatgtgcaccggcaagttccagatcgtgaaggagatcgccgagac ccagcacggcaccatcgtgatcagggtgcagtacgagggcgacggcagcccctgcaag atccccctggagatcatggacctggagaagaggcacgtgctgggcaggctgatcaccg tgaaccccatcgtgaccgagaaggacagccccgtgaacatcgaggccgagcccccctt cggcgacagctacatcatcatcggcgtggagcccggccagctgaagctgcactggttc aagaagggcagcagcatcggccagatgttcgaggccaccgccaggggcgccaggagga tggccatcctgggcgacaccgcctgggacttcggcagcgtgggcggcgtgttcaccag catcggcaaggccctgcaccaggtgttcggcgccatctacggcgcc 162. 7A9 atgaccgtgttcttcatcctgatgatgctggtggccccctcctacgccatgaggtgcg Round I tgggcatcggcaaccgcgacttcgtggagggcgtgagcggcgccacctgggtggacgt shuffled ggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcccaccctggac clone atcgagctgcagaagaccgaggccacccagctggccaccctgaggaagctgtgcatcg aggccagcatcagcaacatcaccaccgccaccaggtgccccacccagggcgaggccac cctggtggaggagcaggacaccaacttcgtgtgcaggaggaccttcgtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcctggtgacctgcgccaagttcc agtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctgaagtacaccat cgtgatcaccccccacagcggcgaggagcacgccgtgggcaacgacactggcaagcac ggcaaggagatcaagatcaccccccagagcagcatcaccgaggccgagctgaccggct acggcaccgtgaccatggagtgcagccccaggaccggcctggacttcaacgagatgat cctgctgaccatgaagagcaagacctggctggtgcacaagcagtggttcctggacctg cccctgccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaaggaga ggatggtgaccttcaagaacgcccacgccaagaagcaggaggtggtggtgctgggcag ccaggagggcgccatgcacaccgccctgaccggcaccaccgagatccagaacagcggc aacctgctgttcaccggccacctgaagtgcaggctgaggatggacaagctgcagctga agggcatgagctacagcatgtgcaccggcaagttccagatcgtgaaggagatcgccga gacccagcacggcaccatcgtgatcagggtgcagtacgagggcgacggcagcccctgc aagatccccctggagatcatggacctggagaagaggcacgtgctgggcaggctgatca ccgtgaaccccatcgtgaccgagaaggacagccccgtgaacatcgaggccgagccccc cttcggcgacagctacatcatcatcggcgtggagcccggccagctgaagctgcactgg ttcaggaagggcagcagcatcggcaagatgttcgagagcacctacaggggcgccaaga ggatggccatcctgggcgagaccgcctgggacttcggcagcgtgggcggcctgctgac cagcctgggcaaggccgtgcaccaggtgttcggcagcgtgtacaccaccatgttcggc 1.63 11E2 atgaccgtgttcttcatcctgctgatgctggtggccccctcctacggcatgaggtgcg Round II tgggcgtgggcaacagggacttcgtggagggcctgagcggcgccacctgggtggacgt shuffled ggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcccaccctggac clone atcgagctgctgaagaccgaggtgaccaaccccgccgtgctgaggaagctgtgcatcg aggccagcatcagcaacatcaccaccgccaccaggtgccccacccagggcgaggccat cctgcccgaggagcaggaccagaactacgtgtgcaagcacacctacgtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcctggtgacctgcgccaagttca agtgcgtgaccaagctggagggcaagatcgtgcagtacgagaacctgaagtacagcgt gatcgtgaccgtgcacaccggcgaccagcaccaggtgggcaacgagaccaccgagcac ggcaccatcgccaccatcaccccccaggcccccaccagcgagatccagctgaccgact acggcgccctgaccctggactgcagccccaggaccggcctggacttcaacagggtggt gctactgaccatgaagaagaagacctggctggtgcacaagcagtggttcctggacctg cccctgccctggaccgccggcgccagcaccagccaggagacctggaacaggaaggagc tgctggtgaccttcaagaacgcccacgccaagaggcaggacgtgaccgtgctgggcag ccaggagggcgccatgcacagcgccctgaccggcgccaccgagatccagatgagcagc ggcaacctgctgttcgccggccacctgaagtgcaggctgaagatggacaagctgcagc tgaagggcatgagctacagcatgtgcaccggcaagttccagatcgtgaaggagatcgc cgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcgacggcgccccc tgcaagatccccctggagatcatggacctggagaagaggcacgtgctgggcaggctga tcaccgtgaaccccatcgtgaccgagaaggacagccccgtgaacatcgaggccgagcc ccccttcggcgagagctacatcgtggtcggcgtgggcgacagcgccctgaccctgcac tggttcaggaagggcagcagcatcggccagatgttcgaggccaccgccaggggcgcca agaggatggccatcctgggcgacaccgcctgggacttcggcagcgtgggcggcgtgtt caccagcatcggcaaggccctgcaccaggtgttcggcgccatctacggcgcc 164. 2E3 atggccgtgttcttcatcctgatgatgctggtggccccctcctacggcatgaggtgcg Round II tgggcgtgggcaacagggacttcgtggagggcctgagcggcgccacctgggtggacgt shuffled ggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcccaccctggac clone atcgagctgatcaagaccaccgccaaggaggtggccctgctgaggacctactgcatcg aggccagcatcagcaacatcaccaccgccaccaggtgccccacccagggcgaggccat cctgcccgaggagcaggaccagaactacgtgtgcaagcacacctacgtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcctggtgacctgcgccaagttca cctgcaagaagaacatggagggcaacatcgtgcagcccgagaacctggagtacaccat cgtgatcaccccccacagcggcgaggagcacgccgtgggcaacgacactggcaagcac ggcaaggagatcaagatcaccccccagagcagcatcaccgaggccgagctgaccggct acggcaccgtgaccatggagtgcagccccaggaccggcctggacttcaacagggtggt gctgctgaccatgaagaagaagagctggctggtgcacaagcagtgqttcctggacctg cccctgccctggaccgccggcgccagcaccagccaggagacctggaacaggaaggagc tgctggtgaccttcaagaacgcccacgccaagaagcaggaggtggtggtgctgggcag ccaggagggcgccatgcacaccgccctgaccggcgccaccgagatccagatgagcagc ggcaacctgctgttcaccggccacctgaagtgcaggctgaggatggacaagctgcagc tgaagggcatgagctacagcatgtgcaccggcaagttccagatcgtgaaggagatcgc cgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcgacggcgccccc tgcaagatccccttcagcaccgaggacggccagggcaaggcccacaacggcaggctga tcaccgccaaccccatcgtgatcgacaaggagaagcccgtgaacatcgagctggagcc ccccttcggcgagagctacatcgtggtgggcgccggcgacaaggccctgaagatcaac tggtacaagaagggcagcagcatcggcaagatgttcgagagcacctacaggggcgcca agaggatggccatcctgggcgagaccgcctgggacttcggcagcgtgggcggcctgct gaccagcctgggcaaggccgtgcaccaggtgttcggcagcgtgtacaccaccatgttc ggc 165. 13E2 atgaccgtgttcttcatcctgatgatgctggtgaccccctctatggccatgaggtgcg Round II tgggcatcggcaaccgcgacttcgtggagggcgtgagcggcgccacctgggtggacgt shuffled ggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcccaccctggac clone ttcgagctgatcaagaccaccgccaaggaggtggccctgctgaggacctactgcatcg aggccagcatcagcaacatcaccaccgccaccaggtgccccacccagggcgagcccta cctgaaggaggagcaggaccagcagtacatctgcaggagggacgtggtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgcgccaagttca agtgcgtgaccaagctggagggcaacatcgtgcagcccgagaacctggagtacaccat cgtgatcaccccccacagcggcgaggagcacgccgtgggcaacgacactggcaagcac ggcaaggagatcaagatcaccccccagagcagcatcaccgaggccgagctgaccggct acggcaccgtgaccatggagtgcagccccaggaccggcctggacttcaacgagatgat cctgctgaccatgaagaagaagagctggctggtgcacaagcagtggttcctggacctg cccctgccctggaccgccggcgccagcaccagccaggagacctggaacaggaaggagc tgctggtgaccttcaagaacgcccacgccaagaagcaggaggtggtggtgctgggcag ccaggagggcgccatgcacaccgccctgaccggcgccaccgagatccagatgagcagc ggcaacctgctgttcaccggccacctgaagtgcaggctgaggatggacaagctgcagc tgaagggcatgagctacaccatgtgcaccggcaagttccagatcgtgaaggagatcgc cgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcaccgacgccccc tgcaagatccccttcagcaccgaggacggccagggcaaggcccacaacggcaggctga tcaccgccaaccccatcgtgatcgacaaggagaagcccgtgaacatcgagctggagcc ccccttcggcgacagctacatcatcatcggcgtggagcccggccagctgaagctgcac tggttcaagaagggcagcagcatcggcaagatgttcgaggccaccgccaggggcgcca agaggatggccatcctgggcgagaccgcctgggacttcggcagcgtgggcggcctgct gaccagcctgggcaagatggtgcaccagatcttcggcagcgtgtacaccaccatgttc ggc 166. 1439 atgaccgtgttcttcatcctgctgatgctggtgaccccctctatggccatgaggtgcg Round II tgggcatcggcaaccgcgacttcgtggagggcgtgagcggcggcgcctgggtggacct shuffled ggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcccaccctggac clone atcgagctgatcaagaccgaggccacccagcccgccaccctgaggaagtactgcatcg aggccaagatcaccaacatcaccaccgacagcaggtgccccacccagggcgaggccat cctgcccgaggagcaggaccagcagtacatctgcaggagggacgtggtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcgtggtgacctgcgccaagttcc agtgcgtgaccaagctggagggcaacatcgtgcagcccgagaacctggagtacaccat cgtgatcaccccccacaccggcgaccagcaccaggtgggcaacgacacccagggcgtg accgtggagatcaccccccaggccagcaccgtggaggccatcctgcccgagtacggca ccctgggcctggagtgcagccccaggaccggcctggacttcaacgagatgatcctgat gaagatgaagaacaaggcctggatggtgcacaagcagtggttcctggacctgcccctg ccctggaccagcggcgccagcaccagccaggagacctggaacaggaaggagctgctgg tgaccttcaagaacgcccacgccaagaagcaggaggtggtggtgctgggcagccagga gggcgccatgcacagcgccctggccggcgccaccgagatccagatgagcagcggcaac ctgctgttcaccggccacctgaagtgcaggctgaagatggacaagctgcagctgaagg gcatgagctacaccatgtgcaccggcaagttccagatcgtgaaggagatcgccgagac ccagcacggcaccatcgtgatcagggtgcagtacgagggcgacggcagcccctgcaag atccccctggagatcatggacctggagaagaggcacgtgctgggcaggctgatcaccg ccaaccccatcgtgatcgacaaggagaagcccgtgaacatcgagctggagcccccctt cggcgacagctacatcgtggtgggcgccggcgacaaggccctgaagatcaactggtac aagaagggcagcagcatcggcaagatgttcgagagcacctacaggggcgccaagagga tggccatcctgggcgagaccgcctgggacttcggcagcgcctacacc 167. 13E11 atgaccgtgttcttcatcctgatgatgctggtgaccccctctatggccatgaggtgcg Round II tgggcgtgggcaacagggacttcgtggagggcgtgagcggcggcgcctgggtggacgt shuffled ggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcccaccctggac clone atcgagctgatcaagaccgaggccacccagcccgccaccctgaggaagtactgcatcg aggccagcatcagcaacatcaccaccgacaccaggtgccccacccagggtgagcccta cctgaaggaggagcaggaccagaactacgtgtgcaagcacacctacgtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgcgccaagttca agtgcgtgaccaagctggagggcaagatcgtgcagtacgagaacctgaagtacagcgt gatcgtgaccgtgcacaccggcgaccagcaccaggtgggcaacgagaccaccgagcac ggcaccatcgccaccatcaccccccaggcccccaccagcgagatccagctgaccgact acggcaccgtgaccatggagtgcagccccaggaccggcctggacttcaacgagatgat cctgctgaccatgaagagcaagacctggctagtgcacaagcagtggttcctggacctg cccctgccctggaccgccggcgccgacaccgagacccccacctggaacaggaaggagc tgctggtgaccttcaagaacgcccacgccaagaagcaggaggtggtggtgctgggcag ccaggagggcgccatgcacaccgccctggccggcgccaccgagatccagatgagcagc ggcaacctgctgttcaccggccacctgaagtgcaggctgaagatggacaagctgcagc tgaagggcatgagctacaccatgtgcaccggcaagttccagatcgtgaaggagatcgc cgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcaccgacgccccc tgcaagatccccttcagcaccgaggacggccagggcaaggcccacaacggcaggctga tcaccgccaaccccatcgtgatcgacaaggagaagcccgtgaacatcgagctggagcc ccccttcggcgagagctacatcgtggtgggcgccggcgagaaggccctgaccctgcac tggttcaggaagggcagcagcatcggccagatgttcgagaccaccatgaggggcgcca agaggatggccatcttgggcgagaccgcctgggacttcggcagcgtgggcggcgtgtt caccagcatcggcaaggccctgcaccaggtgttcggcgccatctacggcgcc 168. 16E8 atgaccgtgttcttcatcctgatgatgctggtggccccctcctacggcatgaggtgcg Round II tgggcgtgggcaacagggacttcgtggagggcctgagcggcgccacctgggtggacgt shuffled ggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcccaccctggac clone ttcgagctgatcaagaccaccgccaaggaggtggccctgctgaggacctactgcatcg aggccagcatcagcaacatcaccaccgccaccaggtgccccacccagggcgagcccta cctgaaggaggagcaggaccagcagtacatctgcaggagggacgtggtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgcgccaagttca agtgcgtgaccaagctggagggcaacatcgtgcagcccgagaacctggagtacaccat cgtgatcaccccccacaccggcgaccagcaccaggtgggcaacgacacccagggcgtg accgtggagatcaccccccaggccagcaccgtggaggccatcctgcccgagtacggca ccctgggcctggagtgcagccccaggaccggcctggacttcaacgagatgatcctgct gaccatgaagaacaaggcctggatggtgcacaagcagtggttcctggacctgcccctg ccctggaccagcggcgccaccaccgagacccccacctggaacaggaaggagctgctgg tgaccttcaagaccgcccacgccaagaagcaggacgtggtggtgctgggcagccagga gggcgccatgcacaccgccctgaccggcgccaccgagatccagatgagcagcggcaac ctgctgttcaccggccacctgaagtgcaggctgaggatggacaagctgcagctgaagg gcatgagctacgtgatgtgcaccggcaagttccagatcgtgaaggagatcgccgagac ccagcacggcaccatcctgatcaaggtggagtacaagggcgaggacgccccctgcaag atccccttcagcaccgaggacggccagggcaaggcccacaacggcaggctgatcaccg ccaaccccatcgtgatcgacaaggagaagcccgtgaacatcgaggccgagcccccctt cggcgagagcaacatcgtgatcggcatcggcgacaaggccctgaagatcaactggtac aagaagggcagcagcatcggcaagatgttcgagaccaccatgaggggcgccaagagga tggccatcttgggcgacaccgcctgggacttcggcagcgtgggcggcgtgttcaccag catcggcaaggccctgcaccaggtgttcggcgccatctacggcgcc 169. 16E10 atgaccgtgttcttcatcctgatgatgctggtggccccctcctacggcatgaggtgca Round II tcggcatcagcaacagggacttcgtggagggcgtgagcggcgccacctgggtggacgt shuffled ggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcccaccctggac clone atcgagctgctgaagaccgaggtgaccaaccccgccaccctgaggaagtactgcatcg aggccaaaatcagcaacaccaccaccgacagcaggtgccccacccagggcgaggccac cctggtggaggagcaggaccagaactacgtgtgcaagcacacctacgtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgcgccaagttca agtgcgtgaccaagctggagggcaagatcgtgcagtacgagaacctgaagtacagcgt gatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgacacccagggcgtg accgtggagatcaccccccaggccagcaccgtggaggccatcctgcccgagtacggcg ccctgggcctggagtgcagccccaggaccggcctggacttcaacgagatgatcctgct gaccatgaagaacaaggcctggatggtgcacgggcagtggttcttcgacctgcccctg ccctggaccagcggcgccaccaccgagacccccacctggaacaggaaggagctgctgg tgaccttcaagaacgcccacgccaagaagcaggaggtggtggtgctgggcagccagga gggcgccatgcacagcgccctggccggcgccaccgagatccagatgagcagcggcaac ctgctgttcaccggccacctgaagtgcaggctgaggatggacaagctgcagctgaagg gcatgagctacagcatgtgcaccggcaagttccagatcgtgaaggagatcgccgagac ccagcacggcaccatcgtgatcagggtgcagtacgagggcgacggcagcccctgcaag atccccctggagatcatggacctggagaagaggcacgtgctgggcaggctgatcaccg tgaaccccatcgtgatcgacaaggagaagcccgtgaacatcgaggccgagcccccctt cggcgagagctacatcgtggtcggcgtgggcgacagcgccctgaccctgcactggttc aggaagggcagcagcatcggccagatgttcgagaccaccatgaggggcgccaagagga tggccatcttgggcgacaccgcctgggacttcggcagcgtgggcggcgtgctgaacag cctgggcaagatggtgcaccagatcttcggcagcgtgtacaccaccatgttcggc 170. 18E9 atgaccgtgttcttcatcctgatgatgctggtggccccctcctacggcatgaggtgcg Round II tgggcgtgggcaacagggacttcgtggagggcctgagcggcgccacctgggtggacgt shuffled ggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcccaccctggac clone ttcgagctgatcaagaccgaggccacccagcccgccaccctgaggaagtactgcatcg aggccaagctgaccaataccaccaccgccaccaggtgccccacccagggcgagcccta cctgaaggaggagcaggaccagaactacgtgtgcaagcacacctacgtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcgtggtgacctgcgccaagttcc agtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctgaagtacagcgt gatcgtgaccgtgcacaccggcgaccagcaccaggtgggcaacgacacccagggcgtg accgtggagatcaccccccaggccagcaccgtggaggccatcctgcccgagtacggca ccctgggcctggagtgcagccccaggaccggcctggacttcaacgagatgatcctgct gaccatgaagaagaagagctggctggtgcacaagcagtggttcctggacctgcccctg ccctggaccagcggcgccagcaccagccaggagacctggaacaggaaggagctgctgg tgaccttcaagaacgcccacgccaagaggcaggacgtgaccgtgctgggcagccagga gggcgccatgcacaccgccctgaccggcgccaccgaggtggacagcggcgacggcaac ctgctgttcaccggccacctgaagtgcaggctgaagatggacaagctgcagctgaagg gcatgagctacaccatgtgcaccggcaagttccagatcgtgaaggagatcgccgagac ccagcacggcaccatcctgatcaaggtggagtacaagggcgaggacgccccctgcaag atccccttcagcagccaggacgagaagggcgtgacccagaacggcaggctgatcaccg ccaaccccatcgtgatcgacaaggagaagcccgtgaacatcgagctggagcccccctt cggcgacagctacatcgtggtgggcgccggcgacaaggccctgaagatcaactggtac aagaagggcagcagcatcggcaagatgttcgagagcacctacaggggcgccaagagga tggccatcctgggcgacaccgcctgggacttcggcagcgtgggcggcgtgctgaacag cctgggcaagatggtgcaccagatcttcggcagcgtgtacaccaccatgttcggc 171. 18E10 atgaccgtgttcttcatcctgctgatgctggtgaccccctctatggccatgaggtgcg Round II tgggcgtgggcaacagggacttcgtggagggcctgagcggcgccacctgggtggacgt shuffled ggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcccaccctggac clone atcgagctgcagaagaccgaggccacccagctggccaccctgaggaagtactgcatcg aggccaagctgaccaataccaccaccgagagcaggtgccccacccagggcgaggccat cctgcccgaggagcaggaccagaactacgtgtgcaagcacacctacgtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgcgccaagttca agtgcgtgaccaagctggagggcaagatcgtgcagtacgagaacctgaagtacagcgt gatcgtgaccgtgcacaccggcgaccagcaccaggtgggcaacgagaccaccgagcac ggcaccatcgccaccatcaccccccaggcccccaccagcgagatccagctgaccgact acggcgccctgaccctggactgcagccccaggaccggcctggacttcaacagggtggt gctgctgaccatgaagaagaagagctggctggtgcacaagcagtggttcctggacctg cccctgccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaaggaga ggatggtgaccttcaagaccgcccacgccaagaagcaggaggtggtggtgctgggcag ccaggagggcgccatgcacaccgccctgaccggcgccaccgaggtggacagcggcgac ggcaacctgctgttcaccggccacctgaagtgcaggctgaggatggacaagctgcagc tgaagggcatgagctacagcatgtgcaccggcaagttccagatcgtgaaggagatcgc cgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcgacggcagcccc tgcaagatccccctggagatcatggacctggagaagaggcacgtgctgggcaggctga tcaccgccaaccccatcgtgatcgacaaggagaagcccgtgaacatcgagctggagcc ccccttcggcgacagctacatcgtggtgggcgccggcgacaaggccctgaagatcaac tggtacaagaagggcagcagcatcggccagatgttcgagaccaccatgaggggcgcca agaggatggccatcttgggcgacaccgcctgggacttcggcagcgtgggcggcctgct gaccagcctgggcaaggccgtgcaccaggtgttcggcagcgtgtacaccaccatgttc ggc 172. 18E11 atgaccgtgttcttcatcctgatgatgctggtggccccctcctacggcatgaggtgcg Round II tgggcgtgggcaacagggacttcgtggagggcctgagcggcgccacctgggtggacgt shuffled ggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcccaccctggac clone ttcgagctgatcaagaccaccgtcaaggaggtggccctgctgaggacctactgcatcg aggccagcatcagcaacatcaccaccgccaccaggtgccccacccagggcgagcccta cctgaaggaggagcaggaccagcagtacatctgcaggagggacgtggtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgcgccaagttca agtgcgtgaccaagctggagggcaacatcgtgcagcccgagaacctggagtacaccat cgtgatcaccccccacaccggcgaccagcaccaggtgggcaacgacacccagggcgtg accgtggagatcaccccccaggccagcaccgtggaggccatcctgcccgagtacggca ccctgggcctggagtgcagccccaggactggcctggacttcaacgagatgatcctgct gaccatgaagaagaagagctggctggtgcacaagcagtggttcctggacctgcccctg ccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaaggagaggatgg tgaccttcaagaccgcccacgccaagaagcaggacgtggtggtgctgggcagccagga gggcgccatgcacaccgccctgaccggcgccaccgaggtggacagcggcgacggcaac ctgctgttcaccggccacctgaagtgcaggctgaagatggacaagctgcagctgaagg gcatgagctacagcatgtgcaccggcaagttccagatcgtgaaggagatcgccgagac ccagcacggcaccatcctgatcaaggtggagtacaagggcgaggacgccccctgcaag atccccttcagcaccgaggacgagaagggcaaggcccacaacggcaggctgatcaccg ccaaccccatcgtgatcgacaaggagaagcccgtgaacatcgaggccgagcccccctt cggcgagagctacatcgtggtgggcgccggcgacaaggccctgaagctgagctggttc aagaagggcagcagcatcggccagatgttcgagagcacctacaggggcgccaagagga tggccatcctgggcgagaccgcctgggacttcggcagcgtgggcggcgtgttcaccag catcggcaaggccctgcaccaggtgttcggcgccatctacggcgcc 173. 11B1 atggtggtgatcttcatcctgctgatgctggtgaccccctccatgacaatgaggtgca Round II tcggcatcagcaacagggacttcgtggagggcgtgagcggcgccacctgggtggacgt shuffled ggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcccaccctggac clone atcgagctgatcaagaccgaggccacccagcccgccaccctgaggaagtactgcatcg aggccaagctgaccaataccaccaccgccaccaggtgccccacccagggcgaggccat cctgcccgaggagcaggaccagaactacgtgtgcaagcacacctacgtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcgtggtgacctgcgccaagttcc agtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctgaagtacaccgt gatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgatacccagggcgtg accgtggagatcaccccccaggccagcaccgtggaggccatcctgcccgagtacggca ccctgggcctggagtgtagccccaggaccggcctggacttcaacgagatgatcctgct gaccatgaagaacaaggcctggatggtgcacaggcagtggttcttcgacctgcccctg ccctggaccagcggcgccaccaccgagacccccacctggaaccacaaggagaggatgg tgaccttcaaggtgccccacgccaagaggcaggacgtgaccgtgctgggcagccagga gggcgccatgcacaccgccctgaccggcgccaccgaggtggacagcggcgacggcaac cacatgttcgccggccacctgaagtgcaggctgaagatggacaagctgcagctgaagg gcatgagctacagcatgtgcaccggcaagttccagatcgtgaaggagatcgccgagac ccagcacggcaccatcgtgatcagggtgcagtacgagggcgacggcagcccctgcaag atccccctggagatcatggacctggagaagaggcacgtgctgggcaggctgatcaccg ccaaccccatcgtgaccgagaaggacagccccgtgaacatcgaggccgagcccccctt cggcgacagctacatcatcatcggcgtggagcccggccagctgaagctgcactggttc aggaagggcagcagcatcggccagatgttcgagagcacctacaggggcgccaagagga tggccatcctgggcgagaccgcctgggacttcggcagcgtgggcggcctgctgaccag cctgggcaaggccgtgcaccaggtgttcggcgccatctacggcgcc 174. 11B8 atgaccgtgttcttcatcctgctgatgctggtggccccctcctacggcatgaggtgcg Round II tgggcgtgggcaacagggacttcgtggagggcctgagcggcgccacctgggtggacgt shuffled ggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcccaccctggac clone atcgagctgctgaagaccgaggtgaccaaccccgccaccctgaggaagtactgcatcg aggccaagctgaccaataccaccaccgagagcaggtgccccacccagggcgagcccta cctgaaggaggagcaggaccagaactacgtgtgcaagcacacctacgtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcgtggtgacctgcgccaagttcc agtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctgaagtacaccgt gatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgacacccagggcgtg accgtggagatcaccccccaggccagcaccgtggaggccatcctgcccgagtacggca ccctgggcctggagtgcagccccaggaccggcctggacttcaacagggtggtgctgct gaccatgaagaagaagacctggctggtgcacaagcagtggttcctggacctgcccctg ccctggaccgccggcgccgacaccgagacccccacctggaacaggaaggagctgctgg tgaccttcaagaacgcccacgccaagaagcaggaggtggtggtgctgggcagccagga gggcgccatgcacagcgccctggccggcgccaccgagatccagatgagcagcggcaac cacatgttcgccggccacctgaagtgcaaggtgaggatggagaagctgcagctgaagg gcatgagctacgtgatgtgcaccggcaagttccagatcgtgaaggagatcgccgagac ccagcacggcaccatcgtgatcagggtgcagtacgagggcgacggcgccccctgcaag atccccttcagcaccgaggacggccagggcaaggcccacaacggcaggctgatcaccg ccaaccccatcgtgatcgacaaggagaagcccgtgaacatcgagctggagcccccctt cggcgacagctacatcgtggtgggcgccggcgacaaggccctgaagatcaactggtac aagaagggcagcagcatcggccagatgttcgaggccaccgccaggggcgccaggagga tggccatcctgggcgacaccgcctgggacttcggcagcgtgggcggcctgctgaccag cctggcaaggccctgcacaggtgttcggcgccatctacggcgcc 175. 11C4 atgaccgtgttcttcatcctgatgatgctggtggccccctcctacggcatgaggtgcg Round II tgggcatcggcaaccgcgacttcgtggagggcctgagcggcgccacctgggtggacgt shuffled ggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcccaccctggac clone atcgagctgctgaagaccgaggtgaccaaccccgccaccctgaggacctactgcatcg aggccagcatcagcaacatcaccaccgagagcaggtgccccacccagggcgagcccta cctgaaggaggagcaggaccagaactacgtgtgcaagcacacctacgtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcgtggtgacctgcgccaagttcc agtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctgaagtacaccat cgtgatcaccccccacagcggcgaggagcacgccgtgggcaacgacactggcaagcac ggcaaggagatcaagatcaccccccagagcagcatcaccgaggccgagctgaccggct acggcaccgtgaccatggagtgcagccccaggaccggcctggacttcaacgagatgat cctgctgaccatgaagagcaagacctggctggtgcacaagcagtggttcctggacctg cccctgccctggaccgccggcgccgacaccagccaggagacctggaacaggaaggagc tgctggtgaccttcaagaacgcccacgccaagaagcaggaggtggtggtgctgggcag ccaggagggcgccatgcacaccgccctgaccggcgccaccgaggtggacagcggcgac ggcaacctgctgttcaccggccacctgaagtgcaggctgaagatggacaagctgcagc tgaagggcatgagctacagcatgtgcaccggcaagttccagatcgtgaaggagatcgc cgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcgacggcagcccc tgcaagatccccctggagatcatggacctggagaagaggcacgtgctgggcaggctga tcaccgtgaaccccatcgtgaccgagaaggacagccccgtgaacatcgaggccgagcc ccccttcggcgacagctacatcatcatcggcgtgggcgacaaggccctgaagctgcac tggttcaagaagggcagcagcatcggcaagatgttcgaggccaccgccaggggcgcca agaggatggccatcctgggcgagaccgcctgggacttcggcagcgcctacacc 176. 11C11 atgaccgtgttcttcatcctgatgatgctggtggccccctcctacggcatgaggtgcg Round II tgggcgtgggcaacagggacttcgtggagggcctgagcggcgccacctgggtggacgt shuffled ggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcccaccctggac clone atcgagctgctgaagaccgaggtgaccaaccccgccgtgctgaggaagctgtgcatcg aggccagcatcagcaacatcaccaccgagagcaggtgccccacccagggcgagcccta cctgaaggaggagcaggaccagaactacgtgtgcaagcacacctacgtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgcgccaagttca agtgcgtgaccaagctggagggcaagatcgtgcagtacgagaacctgaagtacagcgt gatcgtgaccgtgcacaccggcgaccagcaccaggtgggcaacgagacccagggcgtg accgtggagatcaccccccaggccagcaccgtggaggccatcctgcccgagtacggca ccctgggcctggagtgcagccccaggaccggcctggacttcaacgagatgatcctgct gaccatgaagaagaagacctggctggtgcacaagcagtggttcctggacctgcccctg ccctggaccgccggcgccgacaccagccaggagacctggaacaggaaggagctgctgg tgaccttcaagaacgcccacgccaagaagcaggaggtggtggtgctgggcagccagga gggcgccatgcacaccgccctgaccggcgccaccgaggtggacagcggcgacggcaac ctgctgttcaccggccacctgaagtgcaggctgaagatggacaagctgcagctgaagg gcatgagctacagcatgtgcaccggcaagttccagatcgtgaaggagatcgccgagac ccagcacggcaccatcgtgatcagggtgcagtacgagggcgaggacgccccctgcaag atccccttcagcaccgaggacggccagggcaaggcccacaacggcaggctgatcaccg ccaaccccatcgtgaccgagaaggacagccccgtgaacatcgaggccgagcccccctt cggcgacagctacatcgtggtgggcgccggcgagaaggccctgaagctgcactggttc aagaagggcagcagcatcggccagatgttcgaggccaccgccaggggcgccaggagga tggccatcctgggcgacaccgcctgggacttcggcagcgtgggcggcgtgttcaccag catcggcaaggccctgcaccaggtgttcggcgccatctacggcgcc 177. 12H4 atgaccgtgttcttcatcctgatgatgctggtggccccctcctacgccatgaggtgca Round II tcggcatcagcaacagggacttcgtggagggcgtgagcggcgccacctgggtggacgt shuffled ggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcccaccctggac clone atcgagctgatcaagaccgaggccacccagcccgccaccctgaggaagtactgcatcg aggccagcatcagcaacatcaccaccgacagcaggtgccccacccagggcgagcccta cctgaaggaggagcaggaccagcagtacatctgcaggagggacgtggtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgcgccaagttca agtgcgtgaccaagctggagggcaagatcgtgcagtacgagaacctgaagtacagcgt gatcgtgaccgtgcacaccggcgaccagcaccaggtgggcaacgacacccagggcgtg accgtggagatcaccccccaggccagcaccgtggaggccatcctgcccgagtacggca ccctgggcctggagtgcagccccaggaccggcctggacttcaacgagatgatcctgct gaccatgaagaagaagagctggctggtgcacaagcagtggttcctggacctgcccctg ccctggaccgccggcgccgacaccgagacccccacctggaacaggaaggagctgctgg tgaccttcaagaacgcccacgccaagaagcaggaggtggtggtgctgggcagccagga gggcgccatgcacaccgccctgaccggcgccaccgagatccagatgagcagcggcaac ctgctgttcaccggccacctgaagtgcaggctgaagatggacaagctgcagctgaagg gcatgagctacagcatgtgcaccggcaagttccagatcgtgaaggagatcgccgagac ccagcacggcaccatcgtgatcagggtgcagtacgagggcgacggcgccccctgcaag atccccttcagcaccgaggacgagaagggcgtgacccagaacggcaggctgatcaccg ccaaccccatcgtgaccgagaaggacagccccgtgaacatcgaggccgagcccccctt cggcgacagctacatcatcatcggtgtggagcccggccagctgaagctgcactggttc aagaagggcagcagcatcggccagatgttcgaggccaccgccaggggcgccaggagga tggccatcctgggcgacaccgcctgggacttcggcagcgcctacacc 178. 13F11 atgaccgtgttcttcatcctgatgatgctggtgaccccctctatggccatgaggtgcg Round II tgggcatcggcaaccgcgacttcgtggagggcgtgagcggcggcgcctgggtggacct shuffled ggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcccaccctggac clone atcgagctgatcaagaccgaggccacccagcccgccaccctgaggaagtactgcatcg aggccaagatcaccaacatcaccaccgacagcaggtgccccacccagggcgagcccta cctgaaggaggagcaggaccagcagtacatctgcaggagggacgtggtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgcgccaagttca agtgcgtgaccaagctggagggcaagatcgtgcagtacgagaacctgaagtacagcgt gatcgtgaccgtgcacaccggcgaccagcaccaggtgggcaacgagaccaccgagcac ggcaccatcgccaccatcaccccccaggcccccaccagcgagatccagctgaccgact acggcgccctgaccctggactgcagccccaggaccggcctggacttcaacagggtggt gctgctgaccatgaagaagaagacctggctggtgcacaagcagtggttcctggacctg cccctgccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaaggaga ggatggtgaccttcaaggtgccccacgccaagaggcaggacgtgaccgtgctgggcag ccaggagggcgccatgcacagcgccctggccggcgccaccgaggtggacagcggcgac ggcaaccacatgttcgccggccacctgaagtgcaggctgaagatggacaagctgcagc tgaagggcatgagctacagcatgtgcaccggcaagttccagatcgtgaaggagatcgc cgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcgacggcgccccc tgcaagatccccttcagcaccgaggacgagaagggcaaggcccacaacggcaggctga tcaccgccaaccccatcgtgatcgacaaggagaagcccgtgaacatcgagctggagcc ccccttcggcgagagctacatcgtggtgggcgccggcgagaaggccctgaagctgagc tggttcaagaagggcagcagcatcggcaagatgttcgagagcacctacaggggcgcca agaggatggccatcctgggcgacaccgcctgggacttcggcagcgtgggcggcgtgtt caccagcatcggcaaggccctgcaccaggtgttcggcgccatctacggcgcc 179. 14B1 atgaccgtgttcttcatcctgatgatgctggtggccccctcctacggcatgaggtgcg Round II tgggcatcagcaacagggacttcgtggagggcgtgagcggcggcgcctgggtggacct shuffled ggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcccaccctggac clone atcgagctgctgaagaccgaggtgaccaaccccgccaccctgaggaagtactgcatcg aggccaagatcaccaacatcaccaccgacagcaggtgccccacccagggcgagcccta cctgaaggaggagcaggaccagcagtacatctgcaggagggacgtggtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgcgccaagttca agtgcgtgaccaagctggagggcaacatcgtgcagcccgagaacctgaagtacaccat cgtgatcaccccccacaccggcgaccagcaccaggtgggcaacgacacccagggcgtg accgtggagatcaccccccaggccagcaccgtggaggccatcctgcccgagtacggca ccctgggcctggagtgcagccccaggaccggcctggacttcaacgagatgatcctgct gaccatgaagaagaagagctggctggtgcacaagcagtggttcctggacctgcccctg ccctggaccgccggcgccgacaccgagacccccacctggaacaggaaggagctgctgg tgaccttcaagaacgcccacgccaagaggcaggacgtgaccgtgctgggcagccagga gggcgccatgcacagcgccctgaccggcgccaccgaggtggacagcggcgacggcaac ctgctgttcaccggccacctgaagtgcaggctgaagatggacaagctgcagctgaagg gcatgagctacaccatgtgcaccggcaagttccagatcgtgaaggagatcgccgagac ccagcacggcaccatcgtgatcagggtgcagtacgagggcgacggcgccccctgcaag atccccttcagcaccgaggacgagaagggcgtgacccagaacggcaggctgatcaccg ccaaccccatcgtgaccgagaaggacagccccgtgaacatcgaggccgagcccccctt cggcgagagctacatcgtggtgggcgccggcgagaaggccctgaagctgagctggttc aagaagggcagcagcatcggccagatgttcgagagcaccgccaggggcgccaggagga tggccatcctgggcgacaccgcctgggacttcggcagcgcctacacc 180. 14G10 atggccgtgttcttcatcctgctgatgctggtgaccccctctatggcaatgaggtgca Round II tcggcatcagcaacagggacttcgtggagggcgtgagcggcgccacctgggtggacgt shuffled ggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcccaccctggac atcgagctgctgaagaccgaggtgaccaaccccgccaccctgaggaagtactgcatcg aggccaagatcaccaacatcaccaccgacaccaggtgccccacccagggcgaggccat cctgcccgaggagcaggaccagcagtacatctgcaggagggacgtggtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcctggtgacctgcgccaagttca cctgcaagaagaacatggagggcaacatcgtgcagcccgagaacctggagtacaccat cgtgatcaccccccacagcggcgaggagcacgccgtgggcaacgagaccaccgagcac ggcaccatcgccaccatcaccccccaggcccccaccagcgagatccagctgaccgact acggcgccctgaccctggactgcagccccaggaccggcctggacttcaacagggtggt gctgctgaccatgaagaagaagacctggctggtgcacaagcagtggttcctggacctg cccctgccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaaggaga ggatggtgaccttcaaggtgccccacgccaagaggcaggacgtgaccgtgctgggcag ccaggagggcgccatgcacagcgccctggccggcgccaccgagatccagatgagcagc ggcaacctgctgttcaccggccacctgaagtgcaggctgaagatggacaagctgcagc tgaagggcatgagctacagcatgtgcaccggcaagttccagatcgtgaaggagatcgc cgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcgacggcgccccc tgcaagatccccttcagcaccgaggacggccagggcaaggcccacaacggcaggctga tcaccgccaaccccatcgtgatcgacaaggagaagcccgtgaacatcgagctggagcc ccccttcggcgacagctacatcgtggtgggcgccggcgacaaggccctgaagatcaac tggtacaagaagggcagcagcatcggcaagatgttcgagagcacctacaggggcgcca agaggatggccatcctgggcgagaccgcctgggacttcggcagcgtgggcggcgtgtt caccagcatcggcaaggccctgcaccaggtgttcggcgccatctacggcgcc 181. 14H2 atgaccgtgttcttcatcctgatgatgctggtggccccctcctacggcatgaggtgcg Round II tgggcgtgggcaacagggacttcgtggagggcgtgagcggcggcgcctgggtggacct shuffled ggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcccaccctggac clone ttcgagctgatcaagaccaccgccaaggaggtggccctgctgaggacctactgcatcg aggccaagatcaccaacatcaccaccgccaccaggtgccccacccagggcgagcccta cctgaaggaggagcaggaccagaactacgtgtgcaagcacacctacgtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgcgccaagttcc agtgcgtgaccaagctggagggcaacatcgtgcagcccgagaacctggagtacaccat cgtgatcaccccccacagcggcgaggagcacgccgtgggcaacgacacccagggcgtg accgtggagatcaccccccaggccagcaccgtggaggccatcctgaccgactacggcg ccctgaccctggactgcagccccaggaccggcctggacttcaacagggtggtgctgct gaccatgaagaagaagacctggctggtgcacaagcagtggttcctggacctgcccctg ccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaaggagaggatgg tgaccttcaaggtgccccacgccaagaagcaggaggtggtggtgctgggcagccagga gggcgccatgcacaccgccctgaccggcgccaccgaggtggacagcggcgacggcaac ctgctgttcaccggccacctgaagtgcaggctgaagatggacaagctgcagctgaagg gcatgagctacagcatgtgcaccggcaagttccagatcgtgaaggagatcgccgagac ccagcacggcaccatcgtgatcagggtgcagtacaagggcgaggacgccccctgcaag atccccttcagcagccaggacggccagggcaaggcccacaacggcaggctgatcaccg ccaaccccatcgtgaccgagaaggacagccccgtgaacatcgaggccgagcccccctt cggcgagagctacatcgtggtgggcgccggcgagaaggccctgaagctgagctggttc aagaagggcagcagcatcggccagatgttcgagaccaccatgaggggcgccaagagga tggccatcctgggcgagaccgcctgggacttcggcagcgtgggcggcgtgttcaccag catcggcaaggccctgcaccaggtgttcggcgccatctacggcgcc 182. 15C2 atggtggtgatcttcatcctgctgatgctggtgaccccctctatggccatgaggtgcg Round II tcggcatcagcaacagggacttcgtggagggcctgagcggcgccacctgggtggacgt shuffled ggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcccaccctggac clone atcgagctgctgaagaccgaggtgaccaaccccgccaccctgaggaagtactgcatcg aggccaagatcaccaacatcaccaccgacagcaggtgccccacccagggcgagcccta cctgaaggaggagcaggaccagcagtacatctgcaggagggacgtggtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcgtggtgacctgcgccaagttcc agtgcctggagcccatcgagggcaagatcgtgcagtacgagaacctgaagtacagcgt gatcgtgaccgtgcacaccggcgaccagcaccaggtgggcaacgacacccagggcgtg accgtggagatcaccccccaggccagcaccgtggaggccatcctgcccgagtacggca ccctgggcctggagtgcagccccaggactggcctggacttcaacgagatgatcctgct gaccatgaagaacaaggcctggatggtgcacaggcagtggttcttcgacctgcccctg ccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaaggagaggatgg tgaccttcaaggtgccccacgccaagaggcaggacgtgaccgtgctgggcagccagga gggcgccatgcacagcgccctggccggcgccaccgaggtggacagcggcgacggcaac cacatgttcgccggccacctgaagtgcaggctgaagatggacaagctgcagctgaagg gcatgagctacgtgatgtgcaccggcaagttccagatcgtgaaggagatcgccgagac ccagcacggcaccatcgtgatcagggtgcagtacgagggcgacggcgccccctgcaag atccccttcagcaccgaggacgagaagggcgtgacccagaacggcaggctgatcaccg ccaaccccatcgtgatcgacaaggagaagcccgtgaacatcgaggccgagcccccctt cggcgagagctacatcgtcatcggcgtggagcccggccagctgaagctgcactggttc aagaagggcagcagcatcggccagatgttcgaggccaccgccaggggcgccaggagga tggccatcctgggcgagaccgcctgggacttcggcagcgtgggcggcctgctgaccag cctgggcaaggccgtgcaccaggtgttcggcagcgtgtacaccaccatgttcggc 183. 15D4 atgaccgtgttcttcatcctgctgatgctggtgaccccctctatggccatgaggtgcg Round II tgggcatcggcaaccgcgacttcgtggagggcctgagcggcgccacctgggtggacgt shuffled ggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcccaccctggac clone ttcgagctgatcaagaccaccgccaaggaggtggccctgctgaggacctactgcatcg aggccaagatcaccaacatcaccaccgacagcaggtgccccacccagggcgaggccat cctgcccgaggagcaggaccagaactacgtgtgcaagcacacctacgtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgcgccaagttca agtgcctggagcccatcgagggcaaggtggtgcagcccgagaacctggagtacaccat cgtgatcaccccccacagcggcgaggagcacgccgtgggcaacgacactggcaagcac ggcaaggagatcaagatcaccccccagagcagcatcaccgaggccgagctgaccggct acggcaccgtgaccatggagtgcagccccaggaccggcctggacttcaacagggtggt gctgctgaccatgaagaagaagagctggctggtgcacaagcagtggttcctggacctg cccctgccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaaggaga ggatggtgaccttcaagaccgcccacgccaagaagcaggacgtggtggtgctgggcag ccaggagggcgccatgcacagcgccctggccggcgccaccgagatccagatgagcagc ggcaacctgctgttcaccggccacctgaagtgcaggctgaggatggacaagctgaccc tgaagggcgtgagctacgtgatgtgcaccggcagcttcaagctggagaaggaggtggc cgagacccagcacggcaccgtgctggtgcaggtgaagtacgagggcaccgacgccccc tgcaagatccccttcagcaccgaggacgagaagggcgtgacccagaacggcaggctga tcaccgccaaccccatcgtgaccgagaaggacagccccgtgaacatcgaggccgagcc ccccttcggcgagagctacatcgtggtgggcgccggcgacaaggccctgaagatcaac tggtacaagaagggcagcagcatcggcaagatgttcgagagcacctacaggggcgcca agaggatggccatcttgggcgacaccgcctgggacttcggcagcctgggcggcgtgtt caccagcatcggcaaggccctgcaccaagtgttcggcgccatctacggcgcc 184. 15H4 atgaccgtgttcttcatcctgatgatgctggtggccccctcctacggcatgaggtgcg Round II tgggcgtgggcaacagggacttcgtggagggcctgagcggcgccacctgggtggacgt shuffled ggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcccaccctggac clone ttcgagctgatcaagaccgaggtgaccaaccccgccaccctgaggaagtactgcatcg aggccagcatcagcaacatcaccaccgccaccaggtgccccacccagggcgaggccaa cctgcccgaggagcaggaccagaactacgtgtgcaagcacacctacgtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcctggtgacctgcgccaagttcc agtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctgaagtacaccgt gatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgacacccagggcgtg accgtggagatcaccccccaggccagcaccgtggaggccatcctgcccgagtacggca ccctgggcctggagtgcagccccaggaccggcctggacttcaacagggtggtgctgct gaccatgaagaagaagagctggctggtgcacaagcagtggttcctggacctgcccctg ccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaaggagaggatgg tgaccttcaagaccgcccacgccaagaagcaggaggtggtggtgctgggcagccagga gggcgccatgcacagcgccctggccggcgccaccgagatccagatgagcagcggcaac ctgctgttcaccggccgcctgaagtgcaggctgaagatggacaagctgcagctgaagg gcatgagctacgtgatgtgcaccggcaagttccagatcgtgaaggagatcgccgagac ccagcacggcaccatcgtgatcagggtgcagtacgagggcgacggcgccccctgcaag atccccctggagatcatggacctggagaagaggcacgtgctgggcaggctgatcaccg tgaaccccatcgtgatcgacaaggagaagcccgtgaacatcgaggccgagcccccctt tggcgacagctacatcatcatcggcgtggagcccggccagctgaagctgcactggttc aagaagggcagcagcatcggccagatgttgatgagcacctacaggggcgccaagagga tggccatcctgggcgacaccgcctgggacttcggcagcgtgggcggcgtgctgaacag cctgggcaagatggtgcaccagatcttcggcagcgtgtacaccaccatgttcggc 185. 16B4 atgggcatcatcttcatcctgctgatgctggtgaccccctcctacggcatgaggtgcg Round II tgggcgtgggcaacagggacttcgtggagggcctgagcggcgccacctgggtggacgt shuffled ggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcccaccctggac clone atcgagctgctgaagaccgaggtgaccaaccccgccgtgctgaggaagctgtgcatcg aggccagcatcagcaacatcaccaccgacagcaggtgccccacccagggcgaggccat cctgcccgaggagcaggaccagaactacgtgtgcaagcacacctacgtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgcgccaagttca agtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctgaagtacaccgt gatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgacacccagggcgtg accgtggagatcaccccccaggccagcaccgtggaggccatcctgcccgagtacggca ccctgggcctggagtgcagccccaggaccggcctggacttcaacgagatgatcctgct gaccatgaagaagaagacctggctggtgcacaagcagtggttcctcgacctgcccctg ccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaaggagaggatgg tgaccttcaagaccgcccacgccaagaagcaggacgtggtggtgctgggcagccagga gggcgccatgcacaccgccctgaccggcgccaccgagatccagatgagcagcggcacc accaccatcttcgccggccacctgaagtgcaggctgaagatggacaagctgcagctga agggcatgagctacaccatgtgcaccggcaagttccagatcgtgaaggagatcgccga gacccagcacggcaccatcctgatcaaggtggagtacaagggcgaggacgccccctgc aagatccccttcagcaccgaggacgagaagggcgtgacccagaacggcaggctgatca ccgccaaccccatcgtgaccgagaaggacagccccgtgaacatcgaggccgagccccc cttcggcgagagctacatcgtggtcggcgtgggcgacaaggccctgaagatcaactgg tacaagaagggcagcagcatcggccagatgttcgagaccaccatgaggggcgccaaga ggatggccatcctgggcgacaccgcctgggacttcggcagcgtgggcggcctgctgac cagcctgggcaagatggtgcaccagatcttcggcagcgtgtacaccaccatgttcggc 186. 16F12 atgaccgtgttcttcatcctgatgatgctggtggccccctcctacggcatgaggtgcg Round II tgggcgtgggcaacggggacttcgtggagggcctgagcggcgccacctgggtggacgt shuffled ggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcccaccctggac clone ttcgagctgatcaagaccatcgccaaggaggtggccctgctgaggacctactgcatcg aggccagcatcagcaacatcaccaccgacagcaggtgccccacccagggcgaggccat cctgcccgaggagcaggaccagaactacgtgtgcaagcacacctacgtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcgtggtgacctgcgccaagttcc agtgcctggagcccatcgagggcaaggtggtgcagcccgagaacctggagtacaccat cgtgatcaccccccacaccggcgaccagcaccaggtgggcaacgacacccagggcgtg accgtggagatcaccccccaggccagcaccgtggaggccatcctgcccgagtacggca ccctgggcctggagtgcagccccaggaccggcctggacttcaacgagatgatcctgct gaccatgaagaacaaggcctggatggtgcacaggcagtggttcttcgacctgcccctg ccctggaccagcggcgccaccaccgaggtgcactggaaccacaaggagaggatggtga ccttcaagaccgcccacgccaagaagcaggacgtggtggtgctgggcagccaggaggg caccatgcacaccgccctgaccggcgccaccgagatccagatgagcagcggcaccacc accatcttcgccggccacctgaagtgcaaggtgaagatggacaagctgcagctgaagg gcatgagctacaccatgtgcaccggcaagttccagatcgtgaaggagatcgccgagac ccagcacggcaccatcgtgatcagggtgcagtacgagggcgacggcgccccctgcaag atccccctggagatcatggacctggagaagaggcacgtgctgggcaggctgatcaccg tgaaccccatcgtgatcgacaaggagaagcccgtgaacatcgaggccgagcccccctt cggcgagagctacatcgtggtcggcgtgggcgacagcgccctgaagatcaactggtac aagaagggcagcagcatcggcaagatgttcgagagcacctacaggggcgccaagagga tggccatcctgggcgagaccgcctgggacttcggcagcgtgggcggcgtgttcaccag catcggcaaggccctgcaccaggtgttcggcgccatctacggcgcc 187. 16G11 atgggcatcatcttcatcctgctgatgctggtgaccccctctatggccatgaggtgcg Round II tgggcatcggcaaccgcgacttcgtggagggcctgagcggcgccacctgggtggacgt shuffled ggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcccaccctggac clone atcgagctgctgaagaccgaggtgaccaaccccgccgtgctgaggaagctgtgcatcg aggccagcatcagcaacaccaccaccgacagcaggtgccccacccagggcgaggccac cctggtggaggagcaggacaccaacttcgtgtgcaggaggaccttcgtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcgtggtgacctgcgccaagttcc agtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctgaagtacaccgt gatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgacacccagggcgtg accgtggagatcaccccccaggccagcaccgtggaggccatcctgcccgagtacggca ccctgggcctggagtgcagccccaggaccggcctggacttcaacgagatgatcctgct gaccatgaagaacaaggcctggctggtgcacaagcagtggttcctggacctgcccctg ccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaaggagaggatgg tgaccttcaagaccgcccacgccaagaagcaggacgtggtggtgctgggcagccagga gggcgccatgcacaccgccctgaccggcgccaccgagatccagatgagcagcggcaac ctgctgttcaccggccacctgaagtgcaggctgaggatggacaagctgcagctgaagg gcatgagctacgtgatgtgcaccggcaagttccagatcgtgaaggagatcgccgagac ccagcacggcaccatcgtgatcagggtgcagtacgagggcgacggcgccccctgcaag atccccttcagcaccgaggacggccagggcaaggcccacaacggcaggctgatcaccg ccaaccccatcgtgatcgacaaggagaagcccgtgaacatcgagctggagcccccctt cggcgacagctacatcgtggtgggcgccggcgacaaggccctgaagatcaactggtac aagaagggcagcagcatcggcaagatgttcgaggccaccgccaggggcgccaggagga tggccatcctgggcgacaccgcctgggacttcggcagcctgggcggcctgctgaccag cctgggcaaggccgtgcaccaggtgttcggcagcgtgtacaccaccatgttcggc 188. 17A12 atgaccgtgttcttcatcctgatgatgctggtgaccccctccatgacaatgaggtgcg Round II tgggcgtgggcaacagggacttcgtggagggcctgagcggcgccacctgggtggacgt shuffled ggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcccaccctggac clone ttcgagctgatcaagaccgaggccacccagcccgccaccctgaggaagtactgcatcg aggccaagatcaccaacatcaccaccgacagcaggtgccccacccagggcgaggccat cctgcccgaggagcaggaccagaactacgtgtgcaagcacacctacgtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgcgccaagttca agtgcctggagcccatcgagggcaaggtggtgcagcccgagaacctggagtacaccat cgtgatcaccccccacaccggcgaccagcaccaggtgggcaacgacacccagggcgtg accgtggagatcaccccccaggccagcaccgtggaggccatcctgcccgagtacggca ccctgggcctggagtgcagccccaggactggcctggacttcaacgagatgatcctgct gaccatgaagaagaagagctggctggtgcacaagcagtggttcctggacctgcccctg ccctggaccagcggcgccaccaccgagacccccacctggaacaggaaggagctgctgg tgaccttcaagaacgcccacgccaagaggcaggacgtgaccgtgctgggcagccagga gggcgccatgcacagcgccctgaccggcgccaccgagatccagaccagcggcaccacc accatcttcgccggccacctgaagtgcagggtgaggatggacaagctgcagctgaagg gcatgagctacgtgatgtgcaccggcagcttcaagctggagaaggaggtggccgagac ccagcacggcaccgtgctggtgcaggtgaagtacgagggcaccgacgccccctgcaag atccccgtcagcagccaggacgagaagggcgtgacccagaacggcaggctgatcaccg ccaaccccatcgtgaccgagaaggacagccccgtgaacatcgaggccgagcccccctt cggcgacagctacatcatcatcggcgtggagcccggccagctgaagctgcactggttc aagaagggcagcagcatcggccagatgttcgagaccaccatgaggggcgccaagagga tggccatcctgggcgacaccgcctgggacctcggcagcgtgggcggcgtgttcaccag catcggcaaggccctgcaccaggtgttcggcgccatctacggcgcc 189. 17D5 atgaccgtgttcttcatcctgctgatgctggtgaccccctccatgacaatgaggtgca Round II tcggcatcagcaacagggacttcgtggagggcgtgagcggcgccacctgggtggacgt shuffled ggtgctggagcacggcggctgcgtgaccaccatggccaggaacaagcccaccctggac clone atcgagctgatcaagaccgaggccacccagcccgccaccctgaggaagtactgcatcg aggccagcatcagcaacatcaccaccgccaccaggtgccccacccagggcgaggccat cctgcccgaggagcaggaccagcagtacatctgcaggagggacgtggtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcgtggtgacctgcgccaagttcc agtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctgaagtacaccgt gatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgacacccagggcgtg accgtggagatcaccccccaggcccccaccagcgagatccagctgaccgactacggcg ccctgaccctggactgcagccccaggaccggcctggacttcaacagggtggtgctgct gaccatgaagaagaagacctggctggtgcacaagcagtggttcctggacctgcccctg ccctggaccagcggcgccaccaccgagacccccacctggaacaggaaggagctgctgg tgaccttcaagaacgcccacgccaagaagcaggaggtggtggtgctgggcagccagga gggcgccatgcacaccgccctgaccggcgccaccgaggtggacagcggcgacggcaac ctgctgttcaccggccacctgaagtgcaggctgaagatggacaagctgcagctgaagg gcatgagctacaccatgtgcaccggcaagttccagatcgtgaaggagatcgccgagac ccagcacggcaccatcctgatcaaggtggagtacaagggcgaggacgccccctgcaag atccccttcagcaccgaggacgagaagggcgtgacccagaacggcaggctgatcaccg ccaaccccatcgtgaccgagaaggacagccccgtgaacatcgaggccgagcccccctt cggcgagagctacatcgtggtgggcgccggcgacaaggccctgaagatcaactggtac aagaagggcagcagcatcggcaagatgttcgagagcacctacaggggcgccaagagga tggccatcttgggcgacaccgcctgggacttcggcagcctgggcggcgtgttcaccag catcggcaaggccctgcaccaggtgttcggcgccatctacggcgcc 190. 17D11 atgaccgtgttcttcatcctgatgatgctggtggccccctcctacggcatgaggtgcg Round II tgggcaccggcaaccgcgacttcgtggagggcctgagcggcgccacctgggtggacgt shuffled ggtgctggagcacggcggctgcgtgaccaccatggcccagggcaagcccaccctggac clone atcgagctgatcaagaccgaggccacccagcccgccaccctgaggaagtactgcatcg aggccaagctgaccaataccaccaccgagagcaggtgccccacccagggcgagcccta cctgaaggaggagcaggaccagaactacgtgtgcaagcacacctacgtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcgtggtgacctgcgccaagttcc agtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctgaagtacaccgt gatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgacacccagggcgtg accgtggagatcaccccccaggccagcaccgtggaggccatcctgcccgagtacggca ccctgggcctggagtgcagccccaggactggcctggacttcaacgagatgatcctgct gaccatgaagaagaagagctggctggtgcacaagcagtggttcctggacctgcccctg ccctggaccgccggcgccagcaccagccaggagacctggaacaggaaggagctgctgg tgaccttcaagaacgcccacgccaagaagcaggaggtggtggtgctgggcagccagga gggcgccatgcacaccgccctgaccggcgccaccgaggtggacagcggcgacggcaac ctgctgttcaccggccacctgaagtgcaggctgaagatggacaagctgcagctgaagg gcatgagctacaccatgtgcaccggcaagttccagatcgtgaaggagatcgccgagac ccagcacggcaccatcctgatcaaggtggagtacaagggcaccgacgccccctgcaag atccccttcagcagccaggacgagaagggcgtgacccagaacggcaggctgatcaccg ccaaccccatcgtgaccgagaaggacagccccgtgaacatcgaggccgagcccccctt cggcgagagctacatcgtggtgggcgccggcgacaaggccctgaagatcaactggtac aagaagggcagcagcatcggcaagatgttcgagagcacctacaggggcgccaagagga tggccatcctgggcgagaccgcctgggacttcggcagcgtgggcggcgtgttcaccag catcggcaaggccctgcaccaggtgttcggcgccatctacggcgcc 191. 17F5 atgaccgtgttcttcatcctgatgatgctggtggccccctcctacggcatgaggtgcg Round II tgggcgtgggcaacagggacttcgtggagggcctgagcggcgccacctgggtggacgt shuffled ggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcccaccctggac clone atcgagctgctgaagaccgaggtgaccaaccccgccaccctgaggaagtactgcatcg aggccaagatcaccaacatcaccaccgacagcaggtgccccacccagggcgaggccat cctgcccgaggagcaggaccagaactacgtgtgcaagcacacctacgtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcgtggtgacctgcgccaagttcc agtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctgaagtacaccgt gatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgacacccagggcgtg accgtggagatcaccccccaggccagcaccgtggaggccatcctgcccgagtacggca ccctgggcctggagtgcagccccaggaccggcctggacttcaacagggtggtgctgct gaccatgaagaagaagagctggctggtgcacaagcagtggttcctggacctgcccctg ccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaaggagaggatgg tgaccttcaagaccgcccacgccaagaagcaggacgtggtggtgctgggcagccagga gggcgccatgcacagcgccctggccggcgccaccgagatccagatgagcagcggcaac ctgctgttcaccggccacctgaagtgcaggctgaggatggacaagctgcagctgaagg gcatgagctacgtgatgtgcaccggcaagttccagatcgtgaaggagatcgccgagac ccagcacggcaccatcgtgatcagggtgcagtacgagggcgacgacgccccctgcaag atccccttcagcaccgaggacgagaagggcgtgacccagaacggcaggctgatcaccg ccaaccccatcgtgaccgagaaggacagccccgtgaacatcgaggccgagcccccctt cggcgagagctacatcgtggtcggcgtgggcgacagcgccctgaccctgcactggttc aggaagggcagcagcatcggccagatgttcgagaccaccatgaggggcgccaagagga tggccatcttgggcgacaccgcctgggacttcggcagcctgggcggcgtgttcaccag cctgggcaaggccgtgcaccagatcttcggcagcgtgtacaccaccatgttcggc 192. 17F11 atggtggtgatcttcatcctgctgatgctggtgaccccctccatgacaatgaggtgca Round II tcggcatcagcaacagggacttcgtggagggcgtgagcggcgccacctgggtggacgt shuffled ggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcccaccctggac clone ttcgagctgatcaagaccaccgccaaggaggtggccctgctgaggacctactgcatcg aggccagcatcagcaacatcaccaccgccaccaggtgccccacccagggcgaggccat cctgcccgaggagcaggaccagaactacgtgtgcaagcacacctacgtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgcgccaagttcc agtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctgaagtacaccgt gatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgacacccagggcgtg accgtggagatcaccccccaggccagcaccgtggaggccatcctgcccgagtacggca ccctgggcctggagtgcagccccaggaccggcctggacttcaacgagatgatcctgct gaccatgaagaacaaggcctggatggtgcacaagcagtggctcctggacctgcccctg ccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaaggagaggatgg tgaccttcaaggtgccccacgccaagaagcaggacgtggtggtgctaggcagccagga gggcgccatgcacaccgccctgaccggcgccaccgagatccagatgagcagcggcaac ctgctgttcaccggccacctgaagtgcaggctgaggatggacaagctgcagctgaagg gcatgagctacagcatgtgcaccggcaagttccagatcgtgaaggagatcgccgagac ccagcacggcaccatcctgatcaaggtggagtacaagggcgaggacgccccctgcaag atccccttcagcaccgaggacgagaagggcgtgacccacaacggcaggctgatcaccg ccaaccccatcgtgatcgacaaggagaagcccgtgaacatcgagctggagcccccctt cggcgacagctacatcgtgatcggcgtgggcgacagcgccctgaccctgcactggttc aggaagggcagcagcatcggcaagatgttcgagagcacctacaggggcgccaagagga tggccatcctgggcgagaccgcctgggacttcggcagcgtgggcggcctgctgaccag cctgggcaaggccgtgcaccaggtgttcggcagcgtgtacaccaccatgttcggc 193. 17G5 atgaccgtgttcttcatcctgatgatgctggtggccccctctatggccatgaggtgcg Round II tgggcatcggcaaccgcgacttcgtggagggcctgagcggcgccacctgggtggacgt shuffled ggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcccaccctggac clone ttcgagctgatcaagaccaccgccaaggaggtggccctgctgaggacctactgcatcg aggccagcatcagcaacatcaccaccgccaccaggtgccccacccagggcgaggccat cctgaaggaggagcaggaccagaactacgtgtgcaagcacacctacgtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcgtggtgacctgcgccaagttcc agtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctgaagtacaccgt gatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgacacccagggcgtg accgtggagatcaccccccaggccagcaccgtggaggccatcctgcccgagtacggca ccctgggcctggagtgcagccccaggaccggcctggacttcaacgagatgatcctgct gaccatgaagaacaaggcctggatggtgcacaggcagtggttcttcgacctgcccctg ccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaaggagaggatgg tgaccttcaagaccgcccacgccaagaagcaggaggtggtggtgctgggcagccagga gggcgccatgcacagcgccctggccggcgccaccgaggtggacagcggcgacggcaac cacatgttcgccggccacctgaagtgcaaggtgaggatggacaagctgaccctgaagg gcgtgagctacgtgatgtgcaccggcagcttcaagctggagaaggagatcgccgagac ccagcacggcaccatcctgatcaaggtggagtacaagggcgaggacgccccctgcaag atccccttcagcaccgaggacggccagggcaaggcccacaacggcaggctgatcaccg ccaaccccatcgtgatcgacaaggagaagcccgtgaacatcgagctggagcccccctt cggcgacagctacatcgtggtgggcgccggcgacaaggccctgaagatcaactggtac aagaagggcagcagcatcggcaagatgttcgagagcacctacaggggcgdcaagagga tggccatcctgggcgagaccgcctgggacttcggcagcgtgggcggcgtgttcaccag catcggcaaggccctgcaccaggtgttcggcgccatctacggcgcc 194. 17H3 atggtggtgatcttcatcctgctgatgctggtgaccccctccatgacaatgaggtgca Round II tcggcatcagcaacagggacttcgtggagggcgtgagcggcgccacctgggtggacgt shuffled ggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcccaccctggac clone atcgagctgctgaagaccgaggtgaccaaccccgccaccctgaggaagtactgcatcg aggccaagatcaccaacatcaccaccgccaccaggtgccccacccagggcgaggccat cctgcccgaggagcaggaccagaactacgtgtgcaagcacacctacgtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcctggtgacctgcgccaagttcc agtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctgaagtacaccgt gatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgacacccagggcgtg accgtggagatcaccccccaggccagcaccgtggaggccatcctgcccgagtacggca ccctgggcctggagtgcagccccaggactggcctggacttcaacgagatgatcctgct gaccatgaagaagaagacctggatggtgcacaggcagtggttcttcgacctgcccctg ccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaaggagaggatgg tgaccttcaagaccgcccacgccaagaggcaggacgtgaccgtgctgggcagccagga gggcgccatgcacagcgccctgaccggcgccaccgaggtggacagcggcgacggcaac ctgctgttcaccggccacctgaagtgcaggctgaagatggacaagctgcagctgaagg gcatgagctacaccatgtgcaccggcaagttccagatcgtgaaggagatcgccgagac ccagcacggcaccatcctgatcaaggtggagtacaagggcgaggacgccccctgcaag atccccttcagcaccgaggacggccagggcaaggcccacaacggcaggctgatcaccg ccaaccccgcggtgaccgacaaggagaagcccgtgaacatcgagctggagcccccctt cggcgacagctacatcgtggtgggcgccggcgacaaggccctgaagctgagctggttc aagaagggcagcagcatcggccagatgttcgagagcacctacaggggcgccaagagga tggccatcctgggcgagaccgcctgggacctgggcagcatcggcggcgtgttcaccag cgtgggcaag 195. 17H10 atggtggtgatcttcatcctgctgatgctggtgaccccctccatgacaatgaggtgca Round II tcggcatcagcaacagggacttcgtggagggcgtgagcggcggcagctgggtggacgt shuffled ggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcccaccctggac clone atcgagctgctgaagaccgaggtgaccaaccccgccaccctgaggaagtactgcatcg aggccaagatcaccaacatcaccaccgacagcaggtgccccacccagggcgaggccat catgcccgaggagcaggaccagaactacgtgtgcaagcacacctacgtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcctggtgacctgcgccaagttca cctgcaagaagaacatggagggcaaggtggtgcagcacgagaacctgaagtacaccgt gatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgacacccagggcgtg accgtggagatcaccccccaggccagcaccgtggaggccatcctgcccgagtacggca ccctgggcctggagtgcagccccaggactggcctggacttcaacgagatgatcctgct gaccatgaagaacaaggcctggatggtgcacaggcagtggttcctggacctgcccctg ccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaaggagaggatgg tgaccttcaagaccgcccacgccaagaagcaggacgtggtggtgctgggcagccagga gggcgccatgcacaccgccctgaccggcgccaccgaggtggacagcggcgacggcaac ctgctgttcaccggccacctgaagtgcaggctgaagatggacaagctgcagctgaagg gcatgagctacgtgatgtgcaccggcaagttccagatcgtgaaggagatcgccgagac ccagcacggcaccatcgtgatcagggtgcagtacgagggcgacggcgccccctgcaag atccccttcagcaccgaggacggccagggcaaggcccacaacggcaggctgatcaccg ccaaccccatcgtgatcgacaaggagaagcccgtgaacatcgagctggagcccccctt cggcgacagctacatcgtggtgggcgccggcgacaaggccctgaagatcaactggtac aagaagggcagcagcatcggcaagatgttcgaggccaccgccaggggcgccaggagga tggccatcctgggcgacaccgcctgggacttcggcagcgtgggcggcgtgctgaccag cctgggcaagatggtgcaccagatcttcggcagcgtgtacaccaccatgttcggc 196. 17H12 atggtggtgatcttcatcctgctgatgctggtgaccccctccatgacaatgaggtgcg Round II tgggcgtgggcaacagggacttcgtggagggcctgagcggcgccacctgggtggacgt shuffled ggtgctggagcacggcagctgcgtgaccaccatggcccagggcaagcccaccctggac clone atcgagctgctgaagaccgaggtgaccaaccccgccaccctgaggaagtactgcatcg aggccaagctgaccaataccaccaccgagagcaggtgccccacccagggcgagcccta cctgaaggaggagcaggaccagcagtacatctgcaggagggacgtggtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgcgccaagttca agtgcgtgaccaagctggagggcaagatcgtgcagtacgagaacctgaagtacagcgt gatcgtgaccgtgcacaccggcgaccagcaccaggtgggcaacgagaccaccgagcac ggcaccatcgccaccatcaccccccaggcccccaccagcgagatccagctgaccgact acggcgccctgaccctggactgcagccccaggaccggcctggacttcaacagggtggt gctgctgaccatgaagaagaagagctggctggtgcacaagcagtggttcctggacctg cccctgccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaaggaga ggatggtgaccttcaagaccgcccacgccaagaagcaggacgtggtggtgctgggcag ccaggagggcgccatgcacaccgccctgaccggcgccaccgaggtggacagcggcgac ggcaacctgctgttcaccggccacctgaagtgcaggctgaagatggacaagctgcagc tgaagggcatgagctacagcatgtgcaccggcaagttccagatcgtgaaggagatcgc cgagacccagcacggcaccatcgtgatcaaggtggagtacaagggcgaggacgccccc tgcaagatccccttcagcaccgaggacggccagggcaaggcccacaacggcaggctga tcaccgccaaccccatcgtgatcgacaaggagaagcccgtgaacatcgaggccgagcc ccccttcggcgagagctacatcgtggtgggcgccggcgagaaggccctgaagctgagc tggttcaagaagggcagcagcatcggccagatgttcgagaccaccatgaggggcgcca agaggatggccatcctgggcgacaccgcctgggacttcggcagcgtgggcggcctgct gaccagcctgggcaagatggtgcaccagatcttcggcagcgtgtacaccaccatgttc ggc 197. 18A9 atgaccgtgttcttcatcctgctgatgctggtgaccccctctatggccatgaggtgcg Round II tgggcatcggcaaccgcgacttcgtggagggcctgagcggcgccacctgggtggacgt shuffled ggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcccaccctggac clone atcgagctgctgaagaccgaggtgaccaaccccgccaccctgaggaagtactgcatcg aggccaagatcaccaacatcaccaccgccaccaggtgccccacccagggcgaggccat cctgcccgaggagcaggaccagaactacgtgtgcaagcacagcatggtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcgtggtgacctgcgccaagttcc agtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctgaagtacaccgt gatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgacacccagggcgtg accgtggagatcaccccccaggccagcaccgtggaggccatcctgcccgagtacggca ccctgggcctggagtgcagccccaggactggcctggacttcaacgagatgatcctgct gaccatgaagaacaaggcctggatggtgcacaagcagtggttcctggacctgcccctg ccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaaggagaggatgg tgaccttcaaggtgccccacgccaagaggcaggacgtgaccgtgctgggcagccagga gggcgccatgcacagcgccctggccggcgccaccgagatccagatgagcagcggcaac ctgctgttcaccggccacctgaagtgcaggctgaggatggacaagctgaccctgaagg gcgtgagctacgtgatgtgcaccggcaagttccagatcgtgaaggagatcgccgagac ccagcacggcaccatcgtgatcagggtgcagtacgagggcaccgacgccccctgcaag atccccttcagcagccaggacgagaagggcgtgacccagaacggcaggctgatcaccg ccaaccccatcgtgatcgacaaggagaagcccgtgaacatcgagctggagcccccctt cggcgacagctacatcgtggtgggcgccggcgacaaggccctgaccctgcactggttc aggaagggcagcagcatcggccagatgttcgagaccaccatgaggggcgccaagagga tggccatcctgggcgacaccgcctgggacttcggcagcgtgggcggcctgctgaccag cctgggcaagatggtgcaccagatcttcggcagcqtgttcaccagcgtgggcaag 198. 18B7 atgaccgtgttcttcatcctgctgatgctggtgaccccctctatggcaatgaggtgca Round II tcggcatcagcaacagggacttcgtggagggcctgagcggcgccacctgggtggacgt shuffled ggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcccaccctggac clone ttcgagctgatcaagaccaccgccaaggaggtggccctgctgaggacctactgcatcg aggccagcatcagcaacatcaccaccgccaccaggtgccccacccagggcgagcccta cctgaaggaggagcaggaccagcagtacatctgcaggagggacgtggtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcgtggtgacctgcgccaagttcc agtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctgaagtacaccgt gatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgacacccagggcgtg accgtggagatcaccccccaggccagcaccgtggaggccatcctgcccgagtacggca ccctgggcctggagtgcagccccaggaccggcctggacttcaacgagatgatcctgct gaccatgaagaacaaggcctggatggtgcacaagcagtggttcctggacctgcccctg ccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaaggagaggatgg tgaccttcaagaccgcccacgccaagaagcaggacgtggtggtgctgggcagccagga gggcgccatgcacaccgccctgaccggcgccaccgaggtggacagcggcgacggcaac ctgctgttcaccggccacctgaagtgcaggctgaggatggacaagctgcagctgaagg gcatgagctacagcatgtgcaccggcaagttccagatcgtgaaggagatcgccgagac ccagcacggcaccatcctgatcaaggtggagtacaagggcgaggacgccccctgcaag atccccttcagcaccgaggacgagaagggcgtgacccagaacggcaggctgatcaccg ccaaccccatcgtgatcgacaaggagaagcccgtgaacatcgaggccgagcccccctt cggcgacagctacatcgtggtgggcgccggcgacaaggccctgaagatcaactggtac aagaagggcagcagcatcggcaagatgttcgagagcacctacaggggcgccaagagga tggccatcctgggcgagaccgcctgggacttcggcagcgtgggcggcctgctgaccag cctgggcaaggccgtgcaccaggtgttcggcagcgtgtacaccaccatgttcggc 199. 18D7 atggtggtgatcttcatcctgctgatgctggtgaccccctccatgacaatgaggtgca Round II tcggcatcagcaacagggacttcgtggagggcgtgagcggcgccacctgggtggacgt shuffled ggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcccaccctggac clone atcgagctgctgaagaccgaggtgaccaaccccgccgtgctgaggaagctgtgcatcg aggccagcatcagcaacatcaccaccgccaccaggtgccccacccagggcgaggccat cctgcccgaggagcaggaccagaactacgtgtgcaagcacacctacgtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgcgccaagttca agtgcgtgaccaagctggagggcaagatcgtgcagtacgagaacctgaagtacagcgt gatcgtgaccgtgcacaccggcgaccagcaccaggtgggcaacgacacccagggcgtg accgtggagatcaccccccaggccagcaccgtggaggccatcctgcccgagtacggca ccctgggcctggagtgcagccccaggaccggcctggacttcaacgagatgatcctgct gaccatgaagaacaaggcctggatggtgcacaggcagtggttcttcgacctgcccctg ccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaaggagaggatgg tgaccttcaagaccgcccacgccaagaagcaggacgtggtggtgctgggcagccagga gggcgccatgcacaccgccctgaccggcgccaccgagatccagatgagcagcggcacc accaccatcttcgccggccacctgaagtgcaggctgaagatggacaagctgcagctga agggcatgagctacagcatgtgcaccggcaagttccagatcgtgaaggagatcgccga gacccagcacggcaccatcctgatcaaggtgaagtacgagggcaccgacgccccctgc aagatccccttcagcagccaggacgagaagggcgtgacccagaacggcaggctgatca ccgccaaccccatcgtgaccgagaaggacagccccgtgaacatcgaggccgagccccc cttcggcgagagctacatcgtggtgggcgccggcgagaaggccctgaagctgagctgg ttcaagaagggcagcagcatcggccagatgttcgagagcacctacaggggcgccaaga ggatggccatcctgggcgagaccgcctgggacttcggcagcgtgggcggcctgctgac cagcctgggcaaggccgtgcaccaggtgttcggcagcgtgtacaccaccatgttcggc 200. 18H2 atgaccgtgttcttcatcctgatgatgctggtggccccctcctacggcatgaggtgcg Round II tgggcgtgggcaacagggacttcgtggagggcgtgagcggcggcgcctgggtggacct shuffled ggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcccaccctggac clone atcgagctgctgaagaccgaggtgaccaaccccgccaccctgaggaagtactgcatcg aggccaagatcaccaacatcaccaccgacagcaggtgccccacccagggcgaggccat cctgcccgaggagcaggaccagaactacgtgtgcaagcacacctacgtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcgtggtgacctgcgccaagttcc agtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctgaagtacaccgt gatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgacaccagcaaccac ggcgtgaccgtggagatcaccccccaggccagcaccgtggaggccatcctgcccgagt acggcaccctgggcctggagtgcagccccaggaccggcctggacttcaacagggtggt gctgctgaccatgaagaagaagagctggctggtgcacaagcagtggttcctggacctg cccctgccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaaggaga ggatggtgaccttcaaggtgccccacgccaagaagcaggacgtggtggtgctgggcag ccaggagggcgccatgcacaccgccctggccggcgccaccgagatccagatgagcagc ggcaacctgctgttcaccggccacctgaagtgcaggctgaagatggacaagctgcagc tgaagggcatgagctacagcatgtgcaccggcaagttccagatcgtgaaggagatcgc cgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcgacggcgccccc tgcaagatccccttcagcaccgaggacggccagggcaaggcccacaacggcaggctga tcaccgccaaccccgtggtgaccaagaaggaggagcccgtgaacatcgaggccgagcc ccccttcggcgagagcaacatcgtgatcggcatcggcgacaaggccctgaagatcaac tggtacaagaagggcagcagcatcggcaagatgttcgagaccaccatgaggggcgcca agaggatggccatcttgggcgacaccgcctgggacttcggcagcgtgggcggcctgct gaccagcctgggcaagatggtgcaccagatcttcggcagcgtgtacaccaccatgttc ggc 201. 5/21-D1 atgagatctgtgaccatgattctcatgctgctgcctactgctctggccttccatctga Round I caacaagaggtggcgagcctaccctgatcgtgtccaagcaagaacgcggcaagagcct shuffled gctgttcaagacttctgcaggggtgaatatgtgcactctcatcgccatggacctgggc clone gagctgtgtgaggataccatgacctacaagtgtccacggatgaccgaggccgaacccg extended acgatgtggattgctggtgcaatgcaactgatacttgggtgacctatgggacctgtag to ccagacaggcgagcataggagggataagaggtccgtcgccctggaccctcacgttggt C15/full ctgggcctggaaaccagaaccgagacatggatgagctctgagggagcttggaagcaga prM/full tccagaaggttgagacttgggctctgcgccatccaggctttacagtcattgccctgtt E cctggctcatgcaattgggacctctattactcagaagggcatcatcttcattctgctc atgctcgtgacaccatctatggccatgaggtgcgtgggcatcggcaaccgcgacttcg tggagggcctgagcggcgccacctgggtggacgtggtgctggagcacggcagctgcgt gaccaccatggccaagaacaagcccaccctggacatcgagctgctgaagaccgaggtg accaaccccgccaccctgaggaagtactgcatcgaggccaagatcaccaacatcacca ccgacagcaggtgccccacccagggcgaggccatcctgcccgaggagcaggaccagaa ctacgtgtgcaagcacacctacgtggacaggggctggggcaacggctgcggcctgttc ggcaagggcagcctggtgacctgcgccaagttcacctgcaagaagaacatggagggca acatcgtgcagcccgagaacctggagtacaccatcgtgatcaccccccacagcggcga ggagcacgccgtgggcaacgacactggcaagcacggcaaggagatcaagatcaccccc cagagcagcatcaccgaggccgagctgaccggctacggcaccgtgaccatggagtgca gccccaggaccggcctggacttcaacagggtggtgctgctgaccatgaagaagaagag ctggctggtgcacaagcagtggttcctggacctgcccctgccctggaccgccggcgcc gacaccagcgaggtgcactggaaccacaaggagaggatggtgaccttcaaggtgcccc acgccaagaggcaggacgtgaccgtgctgggcagccaggagggcgccatgcacagcgc cctggccggcgccaccgagatccagatgagcagcggcaacctgctgttcaccggccac ctgaagtgcaggctgaggatggacaagctgcagctgaagggcatgagctacgtgatgt gcaccggcaagttccagatcgtgaaggagatcgccgagacccagcacggcaccatcgt gatcagggtgcagtacgagggcgacggcagcccctgcaagatccccctggagatcatg gacctggagaagaggcacgtgctgggcaggctgatcaccgtgaaccccatcgtgatcg acaaggagaagcccgtgaacatcgaggccgagccccccttcggcgagagctacatcgt ggtcggcgtgggcgacagcgccctgaccctgcactggttcaggaagggcagcagcatc ggccagatgttcgagaccaccatgaggggcgccaagaggatggccatcctcggcgata cagcctgggattttggctcaatcggaggggtgttcaccagcgtgggtaaactgattca tcagattttcggtacagcatatggcgtgctgttttccggcgtctcttggaccatgaag atcggtattggcatcctcctcacatggctgggactgaatagcaggagcacaagcctga gcatgacctgcatcgctgttggcatggtcacactctacctgggtgtgatggttcaggc a 202. 6E12-D4 atgagatctaccatcaccctgctctgcctgatccctactgttatggccttctctctgt Round I caacaagagatggcgagcctctcatgatcgtggccaagcacgaaagagggaggcctct shuffled gctgttcaagactactgaagggatcaataagtgcactctcatcgccatggacctgggc clone gagatgtgtgaggataccgtgacctacaagtgtccactgctggtcaacaccgaacccg extended aggatatcgattgctggtgcaatctgacttctacttgggtgatgtatgggacctgtac to ccagtccggagagagaaggagggagaagaggtccgtcgccctgactcctcactctggt C15/full atgggcctggaaaccagagccgagacatggatgagctctgagggagcttggaagcacg prM/full cccagagggttgaatcttggattctgcgcaatccacgctttgcactcctggctggctt E catggcttatatgattgggcagactggtattcagaggaccgtcttcttcgttctgatg atgctcgtggcaccatcttacggcatgaggtgcgtgggcgtgggcaacagggacttcg tggagggcgtgagcggcgccacctgggtggacgtggtgctggagcacggcagctgcgt gaccaccatggccaagaacaagcccaccctggacatcgagctgctgaagaccgaggtg accaaccccgccgtgctgaggaagctgtgcatcgaggccagcatcagcaacatcacca ccgccaccaggtgccccacccagggcgaggccatcctgcccgaggagcaggaccagaa ctacgtgtgcaagcacacctacgtggacaggggctggggcaacggctgcggcctgttc ggcaagggcagcgtggtgacctgcgccaagttccagtgcctggagcccatcgagggca aggtggtgcagcacgagaacctgaagtacaccgtgatcatcaccgtgcacaccggcga ccagcaccaggtgggcaacgacacccagggcgtgaccgtggagatcaccccccaggcc agcaccgtggaggccatcctgcccgagtacggcaccctgggcctggagtgcagcccca ggaccggcctggacttcaacgagatgatcctgctgaccatgaagaacaaggcctggat ggtgcacaggcagtggttcttcgacctgcccctgccctggaccgccggcgccgacacc agcgaggtgcactggaaccacaaggagaggatggtgaccttcaagaccgcccacgcca agaagcaggacgtggtggtgctgggcagccaggagggcgccatgcacaccgccctgac cggcgccaccgagatccagatgagcagcggcaccaccaccatcttcgccggccacctg aagtgcaaggtgaggatggacaagctgaccctgaagggcgtgagctacgtgatgtgca ccggcagcttcaagctggagaaggaggtggccgagacccagcacggcaccgtgctggt gcaggtgaagtacgagggcaccgacgccccctgcaagatccccttcagcagccaggac gagaagggcgtgacccagaacggcaggctgatcaccgccaaccccatcgtgatcgaca aggagaagcccgtgaacatcgaggccgagcccccctttggcgacagctacatcatcat cggcgtggagcccggccagctgaagctgcactggttcaagaagggcagcagcatcggc cagatgttcgagaccaccatgaggggcgccaagaggatggccatcctgggcgacaccg cctgggacttcggcagcgtgggcggcctgctgaccagcctgggcaagatggtgcacca gatcttcggcagcgtgtacaccacaatgtttggcggcgtctcttggatggtgagaatc ctgatcgggttcctcgtcctgtggattggaaccaatagcaggaatacaagcatggcca tgagctgcatcgctgttggcggcatcacactcttcctgggtttcaccgttcacgca 203. 2/7-D1 atgagatctgtgaccatgattctcatgctgctgcctactgctctggccttccatctga Round I caacaagaggtggcgagcctaccctgatcgtgtccaagcaagaacgcggcaagagcct shuffled gctgttcaagacttctgcaggggtgaatatgtgcactctcatcgccatggacctgggc clone gagctgtgtgaggataccatgacctacaagtgtccacggatgaccgaggccgaacccg extended acgatgtggattgctggtgcaatgcaactgatacttgggtgacctatgggacctgtag with DEN- ccagacaggcgagcataggagggataagaggtccgtcgccctggaccctcacgttggt 1 to ctgggcctggaaaccagaaccgagacatggatgagctctgagggagcttggaagcaga C15/full tccagaaggttgagacttgggctctgcgccatccaggctttacagtcattgccctgtt prM/full cctggctcatgcaattgggacctctattactcagaagggcatcatcttcattctgctc E atgctcgtgacaccatctatggccatgaggtgcgtgggcatcggcaaccgcgacttcg tggagggcctgagcggcgccacctgggtggacgtggtgctggagcacggcagctgcgt gaccaccatggccaagaacaagcccaccctggacatcgagctgatcaagaccgaggcc acccagcccgccaccctgaggaagtactgcatcgaggccaagctgaccaataccacca ccgagagcaggtgccccacccagggcgagccctacctgaaggaggagcaggaccagaa ctacgtgtgcaagcacacctacgtggacaggggctggggcaacggctgcggcctgttc ggcaagggcagcctgatcacctgcgccaagttcaagtgcgtgaccaagctggagggca agatcgtgcagtacgagaacctgaagtacagcgtgatcgtgaccgtgcacaccggcga ccagcaccaggtgggcaacgagaccaccgagcacggcaccatcgccaccatcaccccc caggcccccaccagcgagatccagctgaccgactacggcgccctgaccctggactgca gccccaggaccggcctggacttcaacagggtggtgctgctgaccatgaagaagaagac ctggctggtgcacaagcagtggttcctggacctgcccctgccctggaccgccggcgcc agcaccagccaggagacctggaacaggaaggagctgctggtgaccttcaagaacgccc acgccaagaagcaggaggtggtggtgctgggcagccaggagggcgccatgcacaccgc cctgaccggcgccaccgaggtggacagcggcgacggcaacctgctgttcaccggccac ctgaagtgcaggctgaagatggacaagctgcagctgaagggcatgagctacagcatgt gcaccggcaagttccagatcgtgaaggagatcgccgagacccagcacggcaccatcgt gatcagggtgcagtacgagggcgacggcgccccctgcaagatccccttcagcaccgag gacggccagggcaaggcccacaacggcaggctgatcaccgccaaccccatcgtgatcg acaaggagaagcccgtgaacatcgagctggagccccccttcggcgacagctacatcgt ggtgggcgccggcgacaaggccctgaagatcaactggtacaagaagggcagcagcatc ggcaagatgttcgagagcacctacaggggcgccaagaggatggccatcctcggcgata cagcctgggattttggctcaatcggaggggtgttcaccagcgtgggtaaactgattca tcagattttcggtacagcatatggcgtgctgttttccggcgtctcttggaccatgaag atcggtattggcatcctcctcacatggctgggactgaatagcaggagcacaagcctga gcatgacctgcatcgctgttggcatggtcacactctacctgggtgtgatggttcaggc a 204. 2G11-D4 atgagatctaccatcaccctgctctgcctgatccctactgttatggccttctctctgt Round I caacaagagatggcgagcctctcatgatcgtggccaagcacgaaagagggaggcctct shuffled gctgttcaagactactgaagggatcaataagtgcactctcatcgccatggacctgggc clone gagatgtgtgaggataccgtgacctacaagtgtccactgctggtcaacaccgaacccg extended aggatatcgattgctggtgcaatctgacttctacttgggtgatgtatgggacctgtac to ccagtccggagagagaaggagggagaagaggtccgtcgccctgactcctcactctggt C15/full atgggcctggaaaccagagccgagacatggatgagctctgagggagcttggaagcacg prM/full cccagagggttgaatcttggattctgcgcaatccacgctttgcactcctggctggctt E catggcttatatgattgggcagactggtattcagaggaccgtcttcttcgttctgatg atgctcgtggcaccatcttacggcatgaggtgcgtgggcgtgggcaacagggacttcg tggagggcgtgagcggcgccacctgggtggacgtggtgctggagcacggcggctgcgt gaccaccatggccaagaacaagcccaccctggacttcgagctgatcaagaccaccgcc aaggaggtggccctgctgaggacctactgcatcgaggccagcatcagcaacatcacca ccgccaccaggtgccccacccagggcgagccctacctgaaggaggagcaggaccagca gtacatctgcaggagggacgtggtggacaggggctggggcaacggctgcggcctgttc ggcaagggcagcctgatcacctgcgccaagttcaagtgcgtgaccaagctggagggca acatcgtgcagcccgagaacctggagtacaccatcgtgatcaccccccacaccggcga ccagcaccaggtgggcaacgacacccagggcgtgaccgtggagatcaccccccaggcc agcaccgtggaggccatcctgcccgagtacggcaccctgggcctggagtgcagcccca ggactggcctggacttcaacgagatgatcctgctgaccatgaagaacaaggcctggat ggtgcacaggcagtggttcttcgacctgcccctgccctggaccagcggcgccaccacc gagacccccacctggaacaggaaggagctgctggtgaccttcaagaacgcccacgcca agaagcaggaggtggtggtgctgggcagccaggagggcgccatgcacagcgccctggc cggcgccaccgagatccagatgagcagcggcaacctgctgttcaccggccacctgaag tgcaggctgaagatggacaagctgcagctgaagggcatgagctacaccatgtgcaccg gcaagttccagatcgtgaaggagatcgccgagacccagcacggcaccatcctgatcaa ggtggagtacaagggcgaggacgccccctgcaagatccccttcagcaccgaggacgag aagggcgtgacccagaacggcaggctgatcaccgccaaccccatcgtgaccgagaagg acagccccgtgaacatcgaggccgagccccccttcggcgagagctacatcgtggtggg cgccggcgagaaggccctgaagctgagctggttcaagaagggcagcagcatcggccag atgttcgagagcacctacaggggcgccaagaggatggccatcctgggcgagaccgcct gggacttcggcagcgtgggcggcctgctgaccagcctgggcaaggccgtgcaccaggt gttcggcagcgtgtacaccacaatgtttggcggcgtctcttggatggtgagaatcctg atcgggttcctcgtcctgtggattggaaccaatagcaggaatacaagcatggccatga gctgcatcgctgttggcggcatcacactcttcctgggtttcaccgttcacgca 205. 21C1 atgaacaggaggaggagaactgtgggcgtgattatcatgctgatccctactgctatgg Shuffled ccttccatctgacaacaagaaatggcgagcctcacatgatcgtgggcaggcaagagaa clone gggcaagagcctgctgttcaagactgaggacggcgtgaatatgtgcactctcatggcc (C15/full atcgacctgggcgagctgtgtgaggataccgtgacctacaagtgtccactgctggtca prM/full acaccgaacccgaggatatcgattgctggtgcaatctgacttctgcttgggtgaccta E) tgggacctgtaaccaggctggcgagcataggagggataagaggtccgtcgccctggct cctcacgttggtatgggcctggaaaccagaaccgagacatggatgagctctgagggag cttggaagcacgcccagaggatcgagacctggattctgcgccatccaggctttatcat catggctgccatcctggcttatacaattgggaccactcacttccagagagccctcatc ttcattctgctcaccgccgtggcaccatctatgaccatgaggtgcatcggcatcagca acagggacttcgtggagggcgtgagcggcggcgcctgggtggacctggtgctggagca cggcggctgcgtgaccaccatggcccagggcaagcccaccctggacttcgagctgatc aagaccaccgccaaggaggtggccctgctgaggacctactgcatcgaggccagcatca gcaacatcaccaccgccaccaggtgccccacccagggcgagccctacctgaaggagga gcaggaccagcagtacatctgcaggagggacgtggtggacaggggctggggcaacggc tgcggcctgttcggcaagggcggcgtggtgacctgcgccaagttcaagtgcgtgacca agctggagggcaagatcgtgcagtacgagaacctggagtacaccgtggtggtgaccgt gcacaacggcgacacccacgccgtgggcaacgacaccagcaaccacggcgtgaccgcc accatcacccccaggagccccagcgtggaggtgaagctgcccgactacggcgagctga ccctggactgcgagcccaggagcggcatcgacttcaacgagatgatcctgatgaagat gaagaagaagacctggctggtgcacaagcagtggttcctggacctgcccctgccctgg accgccggcgccgacaccagcgaggtgcactggaaccacaaggagaggatggtgacct tcaaggtgccccacgccaagaggcaggacgtgaccgtgctgggcagccaggagggcgc catgcacagcgccctggccggcgccaccgaggtggacagcggcgacggcaaccacatg ttcgccggccacctgaagtgcaaggtgaggatggacaagctgcagctgaagggcatga gctacagcatgtgcaccggcaagttccagctcgtgaaggagatcgccgagacccagca cggcaccatcgtgatcagggtgcagtacgagggcgacggcagcccctgcaagatcccc ctggagatcatggacctggagaagaggcacgtgctgggcaggctgatcaccgtgaacc ccatcgtgatcgacaaggagaagcccgtgaacatcgaggccgagccccccttcggcga cagctacatcatcatcggcgtgggcgacagcgccctgaccctgcactggttcaggaag ggcagcagcatcggcaagatgttcgagagcacctacaggggcgccaggaggatggcca tcctgggcgagaccgcctgggacttcggcagcgtgggcggcctgctgaccagcctggg caaggccgtgcaccaggtgttcggcagcgtgtacaccacaatgtttggcggcgtctct tggatggtgagaatcctgatcgggttcctcgtcctgtggattggaaccaatagcagga atacaagcatggccatgagctgcatcgctgttggcggcatcacactcttcctgggtgc catggttcaggca 206. 23C12 atgagatctgtgaccatgattctcatgctgctgcctactgctctggccttccatctga Shuffled caacaagaggtggcgagcctaccctgatcgtgtccaagcaagaacgcggcaagagcct clone gctgttcaagactgcttcagggatcaatatgtgcactctcatcgccatggacctgggc (C15/full gagatgtgtgacgataccgtgacctacaagtgtccacacatcaccgaggtcgaacccg prM/Full gatatcgattgctggtgcaatctgacttctacttgggtgacctatgggacctgtaa E) ccaggctggcgagcataggagggataagaggtccgtcgccctggttcctcacgttggt atgggcctggaaaccagaaccgagacatggatgagctctgagggagcttggaagcacg cccagaggatcgagacctggattctgcgccatccaggctttatcatcatggctgccat cctggcttatacaattgggaccactcacttccagagagccctcatcttcattctgctc atgctcgtgacaccatctatgaccatgaggtgcgtgggcgtgggcaacagggacttcg tggagggcctgagcggcgccacctgggtggacgtggtgctggagcacggcggctgcgt gaccaccatggccaagaacaagcccaccctggacatcgagctgcagaagaccgaggcc acccagctggccaccctgaggaagctgtgcatcgagggcaagatcaccaacatcacca ccgacagcaggtgccccacccagggcgaggccatcctgcccgaggagcaggaccagaa ctacgtgtgcaagcacacctacgtggacaggggctggggcaacggctgcggcctgttc ggcaagggcagcctggtgacctgcgccaagttccagtgcctggagcccatcgagggca aggtggtgcagcacgagaacctgaagtacaccgtgatcatcaccgtgcacaccggcga ccagcaccaggtgggcaacgacacccagggcgtgaccgtggagatcaccccccaggcc agcaccgtggaggccatcctgcccgagtacggcaccctgggcctggagtgcagcccca ggaccggcctggacttcaacgagatgatcctgctgaccatgaagaacaaggcctggat ggtgcacaggcagtggttcttcgacctgcccctgccctggaccagcggcgccaccacc gagacccccacctggaacaggaaggagctgctggtgaccttcaagaacgcccacgcca agaagcaggaggtggtggtgctgggcagccaggagggcgccatgcacaccgccctgac cggcaccaccgagatccagaacagcggcggcaccagcatcttcgccggccacctgaag tgcaggctgaagatggacaagctggagctgaagggcatgagctacagcatgtgcaccg gcaagttccagatcgtgaaggagatcgccgagacccagcacggcaccatcgtgatcag ggtgcagtacgagggcgacggcagcccctgcaagatccccctggagatcatggacctg gagaagaggcacgtgctgggcaggctgatcaccgtgaaccccatcgtgaccgagaagg acagccccgtgaacatcgaggccgagccccccttcggcgacagctacatcatcatcgg cgtggagcccggccagctgaagctgcactggttcaggaagggcagcagcatcggccag atgttcgagaccaccatgaggggcgccaagaggatggccatcctgggcgagaccgcct gggacttcggcagcgtgggcggcctgctgaccagcctgggcaaggccgtgcaccaggt gttcggcagcgtgtacaccacaatgtttggcggcgtctcttggatggtgagaatcctg atcgggttcctcgtcctgtggattggaaccaatagcaggaatacaagcatggccatga gctgcatcgctgttggcggcatcacactcttcctgggtttcaccgttcacgca 207. 23D5 atgaaaactagcctctgcctgatgatgatgctgcctgctactctggccttccatctga Shuffled catcaagagatggcgagcctaggatgatcgtgggcaagaacgaaagagggaagagcct clone gctgttcaagactgaggacggcgtgaatatgtgcactctcatcgccatggacctgggc (C15/full gagctgtgtgaggataccatgacctacaagtgtccacggatgaccgaggtcgaacccg prM/full aggatatcgattgctggtgcaatctgacttctacttgggtgacctatgggacctgtaa E) ccaggctggcgagcataggagggataagaggtccgtcgccctggctcctcacgttggt atgggcctggacaccagaacccagacatggatgagcgctgagggagcttggaggcagg tcgagaaggttgaaacttgggctctgcgccatccaggctttacaatcctggccctgtt cctggctcattacattgggacctctattactcagaagggcatcatcttcattctgctc atgctcgtgacaccatctatggccatgaggtgcgtgggcatcggcaaccgcgacttcg tggagggcctgagcggcgccacctgggtggacgtggtgctggagcacggcagctgcgt gaccaccatggccaagaacaagcccaccctggacatcgagctgctgaagaccgaggtg accaaccccgccgtgctgaggaagctgtgcatcgaggccaagatcagcaacaccacca ccgacagcaggtgccccacccagggcgaggccaccctggtggaggagcaggacaccaa cttcgtgtgcaggaggaccttcgtggacaggggctggggcaacggctgcggcctgttc ggcaagggcagcctgatcacctgcgccaagttcaagtgcgtgaccaagctggagggca agatcgtgcagtacgagaacctgaagtacagcgtgatcgtgaccgtgcacaacggcga cacccacgccgtgggcaacgacacccagggcgtgaccgtggagatcaccccccaggcc cccaccagcgagatccagctgaccgactacggcgccctgaccctggactgcagcccca ggaccggcctggacttcaacagggtggtgctgctgaccatgaagaagaagagctggct ggtgcacaagcagtggttcctggacctgcccctgccctggaccagcggcgccagcacc agccaggagacctggaacaggcaggacctgctggtgaccttcaagaccgcccacgcca agaagcaggaggtggtggtgctgggcagccaggagggcgccatgcacaccgccctgac cggcaccaccgagatccagaccagcggcaccaccaccatcttcgccggccacctgaag tgcaggctgaagatggacaagctgaccctgaagggcatgagctacgtgatgtgcaccg gcagcttcaagctggagaaggaggtggccgagacccagcacggcaccgtgctggtgca ggtgaagtacgagggcaccggcgccccctgcaaggtgcccatcgagatcagggacgtg aacaaggagaaggtggtgggcaggatcatcagcagcacccccctggccgagaacacca acagcgtgaccaacatcgagctggagccccccttcggcgacagctacatcgtgatcgg cgtgggcgacagcgccctgaccctgcactggttcaggaagggcagcagcatcggcaag atgttcgagagcaccgccaggggcgccaggaggatggccatcctcggcgatacagcct gggattttggctcaatcggaggggtgttcaccagcgtgggtaaactgattcatcagat tttcggtacagcatatggcgtgctgttttccggcgtctcttggaccatgaagatcggt attggcatcctcctcacatggctgggactgaatagcaggagcacaagcctgagcatga cctgcatcgctgttggcatggtcacactctacctgggtgtggtggttcaggca 208. 23F5 atgagatctgccatcaccctgctctgcctgatccctactgttatggccttctctctgt Shuffled caacaagagatggcgagcctctcatgatcgtggccaagcacgaaagagggaggcctct clone gctgttcaagactactgaagggatcaataagtgcactctcatcgccatggacctgggc (C15/full gagatgtgtgaggataccgtgacctacaagtgtccactgctggtcaacaccgaacccg prM/full aggatatcgattgctggtgcaatctgacttctacttgggtgatgtatgggacctgtac E) ccagtccggagagagaaggagggagaagaggtccgtcgccctgactcctcactctggt atgggcctggacaccagaacccagacatggatgagcgctgagggagcttggaagcacg cccagagggttgaatcttggattctgcgcaatccacgctttgcactcctggctggctt catggcttatatgattgggcagactggtattcagaggaccgtcttcttcgttctgatg atgctcgtggcaccatctatggccatgaggtgcgtgggcatcggcaaccgcgacttcg tggagggcctgagcggcggcgcctgggtggacctggtgctggagcacggcggctgcgt gaccaccatggccaagaacaagcccaccctggacttcgagctgatcaagaccgaggcc acccagcccgccaccctgaggaagtactgcatcgaggccaagctgaccaataccacca ccgagagcaggtgccccacccagggcgaggccatcctgcccgaggagcaggaccagaa ctacgtgtgcaagcacacctacgtggacaggggctggggcaacggctgcggcctgttc ggcaagggcagcctggtgacctgcgccaagttccagtgcctggagcccatcgagggca aggtggtgcagcacgagaacctgaagtacaccgtgatcatcaccgtgcacaccggcga ccagcaccaggtgggcaacgacacccagggcgtgaccgtggagatcaccccccaggcc agcaccgtggaggccatcctgcccgagtacggcaccctgggcctggagtgcagcccca ggaccggcctggacttcaacgagatgatcctgctgaccatgaagaacaagacctggct ggtgcacaagcagtggttcctggacctgcccctgccctggaccgccggcgccgacacc cgcgaggtgcactggaaccacaaggagaggatggtgaccttcaaggtgccccacgcca agaggcaggacgtgaccgtgctgggcagccaggagggcgccatgcacagcgccctggc cggcgccaccgaggtggacagcggcgacggcaaccacatgttcgccggccacctgaag tgcaggctgaagatggacaagctggagctgaagggcatgagctacagcatgtgcaccg gcaagttccggatcgtgaaggagatcgccgagacccagcacggcaccatcgtgatcag ggtgcagtacgagggcgacggcagcccctgcaagatccccctggagatcatggacctg gagaagaggcacgtgctgggcaggctgatcaccgtgaaccccatcgtgatcgacaagg agaagcccgtgaacatcgaggccgagccccccttcggcgagagctacatcgtggtggg cgccggcgagaaggccctgaagctgagctggttcaagaagggcagcagcatcggcaag atgttcgaggccaccgccaggggcgccaggaggatggccatcctcggcgatacagcct gggattttggctcaatcggaggggtgttcaccagcgtgggtaaactgattcatcagat tttcggtacagcatatggcgtgctgttttccggcgtctcttggaccatgaagatcggt attggcatcctcctcacatggctgggactgaatagcaggagcacaagcctgagcatga cctgcatcgctgttggcatggtcacactctacctgggtgtgatggttcaggca 209. 23G3 atgaaaactagcctctgcctgatgatgatgctgcctgctactctggccttccatctga Shuffled caacaagaaatggcgagcctcacatgatcgtgggcaggcaagagaagggcaagagcct clone gctgttcaagacttctgcaggggtgaatatgtgcactctcatcgccatggacctgggc C15/full gagctgtgtgaggataccatgacctacaagtgtccacggatgaccgaggccgaacccg prM/full aggatatcgattgctggtgcaatctgacttctacttgggtgacctatgggacctgtac E) ccagtccggagagagaaggagggagaagaggtccgtcgccctgactcctcactctggt atgggcctggaaaccagagccgagacatggatgagctctgagggagcttggaagcaga tccagaaggttgagacttgggctctgcgccatccaggctttacagtcattgccctgtt cctggctcatgcaattgggacctctattactcagaagggcatcatcttcattctgctc atgctcgtgacaccatctatggccatgaggtgcgtgggcatcggcaaccgcgacttcg tggagggcctgagcggcgccacctgggtggacgtggtgctggagcacggcggctgcgt gaccaccatggccaagaacaagcccaccctggacatcgagctgcagaagaccgaggcc acccagctggccaccctgaggaagctgtgcatcgagggcaagatcaccaacatcacca ccgacagcaggtgccccacccagggcgaggccatcctgcccgaggagcaggaccagaa ctacgtgtgcaagcacacctacgtggacaggggctggggcaacggctgcggcctgttc ggcaagggcagcctggtgacctgcgccaagttccagtgcctggagcccatcgagggca aggtggtgcagcacgagaacctgaagtacaccgtgatcatcaccgtgcacaccggcga ccagcaccaggtgggcaacgacacccagggcgtgaccgtggagatcaccccccaggcc agcaccgtggaggccatcctgcccgagtacggcaccctgggcctggagtgcagcccca ggaccggcctggacttcaacgagatgatcctgctgaccatgaagaacaaggcctggat ggtgcacaggcagtggttcttcgacctgcccctgccctggaccagcggcgccaccacc gagacccccacctggaacaggaaggagctgctggtgaccttcaagaacgcccacgcca agaagcaggaggtggtggtgctgggcagccaggagggcgccatgcacaccgccctgac cggcgccaccgagatccagaccagcggcaccaccaccatcttcgccggccacctgaag tgcaggctgaagatggacaagctgaccctgaagggcgtgagctacgtgatgtgcaccg gcagcttcaagctggagaaggaggtggccgagacccagcacggcaccgtgctggtgca ggtgaagtacgagggcaccgacgccccctgcaagatccccttcagcagccaggacgag aagggcgtgacccagaacggcaggctgatcaccgccaaccccatcgtgatcgacaagg agaagcccgtgaacatcgaggccgagccccccttcggcgagagcaacatcgtgatcgg catcggcgacaaggccctgaagatcaactggttcaagaagggcagcagcatcggcaag atgttcgaggccaccgccaggggcgccaggaggatggccatcctcggcgatacagcct gggattttggctcaatcggaggggtgttcaccagcatcggcaaggccctgcaccaggt tttcggtgcaatctatggcgcagccttttccggcgtctcttggaccatgaagatcctg atcggggtcatcatcacatggattggaatgaatagcaggagcacaagcctgagcgtta gcctcgtcctggttggcgtcatcacactctacctgggtgccatggttcaggca 210. 23H7 atgaaaactagcctctgcctgatgatgatgctgcctgctactctggccttccatctga Shuffled catcaagagatggcgagcctctcatgatcgtggccaagcacgaaagagggaggcctct clone gctgttcaagactactgaagggatcaataagtgcactctcatcgccatggacctgggc (C15/full gagatgtgtgaggataccgtgacctacaagtgtccactgctgaggcagaacgaacccg prM/full acgatgtggattgctggtgcaatgcaactgatacttgggtgacctatgggacctgtag E) ccagacaggcgagcataggagggataagaggtccgtcgccctggaccctcacgttggt ctgggcctggaaaccagaaccgagacatggatgagctctgagggagcttggaagcaga tccagaaggttgagacttgggctctgcgccatccaggctttacagtcattgccctgtt cctggctcatgcaattgggacctctattactcagaagggcatcatcttcattctgctc atgctcgtgacaccatctatgaccatgaggtgcgtgggcatcggcaaccgcgacttcg tggagggcctgagcggcgccacctgggtggacgtggtgctggagcacggcagctgcgt gaccaccatggccaagaacaagcccaccctggacatcgagctgctgaagaccgaggtg accaaccccgccgtgctgaggaagctgtgcatcgaggccaagatcagcaacaccacca ccgacagcaggtgccccacccagggcgaggccaccctggtggaggagcaggacaccaa cttcgtgtgcaggaggaccttcgtggacaggggctggggcaacggctgcggcctgttc ggcaagggcagcctgatcacctgcgccaagttcaagtgcctggagcccatcgagggca aggtggtgcagcacgagaacctgaagtacaccgtgatcatcaccgtgcacaccggcga ccagcaccaggtgggcaacgacacccagggcgtgaccgtggagatcaccccccaggcc agcaccgtggaggccatcctgcccgagtacggcaccctgggcctggagtgcagcccca ggaccggcctggacttcaacagggtggtgctgctgaccatgaagaagaagagctggct ggtgcacaagcagtggttcctggacctgcccctgccctggaccagcggcgccagcacc agccaggagacctggaacaggcaggacctgctggtgaccttcaagaccgcccacgcca aagagcaggaggtggtggtgctgggcagccaggagggcgccatgcacaccgccctgac cggcgccaccgagatccagaccagcggcaccaccaccatcttcaccggccacctgaag tgcaggctgaagatggacaagctgaccctgaagggcgtgagctacgtgacgtgcaccg gcagcttcaagctggagaaggaggtggccgagacccagcacggcaccgtgctggtgca ggtgaagtacgagggcaccgacgccccctgcaagatccccttcagcagccaggacgag aagggcgtgacccagaacggcaggctgatcaccgccaaccccgtggtgaccaagaagg aggagcccgtgaacatcgaggccgagccccccttcggcgagagcaacatcgtgatcgg catcggcgacaaggccctgaagatcaactggtacaagaagggcagcagcatcggcaag atgttcgaggccaccgccaggggcgccaggaggatggccatcctcggcgatacagcct gggattttggctcaatcggaggggtgttcaccagcgtgggtaaactgattcatcagat tttcggtacagcatatggcgtgctgttttccggcgtctcttggaccatgaagatcggt attggcatcctcctcacatggctgggactgaatagcaggagcacaagcctgagcatga cctgcatcgctgttggcatggtcacactctacctgggtgccatggttcaggca 211. DEN-1 atgggcatcatcttcatcctgctgatgctggtgaccccctctatggccatgaggtgcg PRM15/tE tgggcatcggcaaccgcgacttcgtggagggcctgagcggcgccacctgggtggacgt CO parent ggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcccaccctggac PRM15/ atcgagctgctgaagaccgaggtgaccaaccccgccgtgctgaggaagctgtgcatcg truncated aggccaagatcagcaacaccaccaccgacagcaggtgccccacccagggcgaggccac envelope cctggtggaggagcaggacaccaacttcgtgtgcaggaggaccttcgtggacaggggc codon tggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgcgccaagttca optimized agtgcgtgaccaagctggagggcaagatcgtgcagtacgagaacctgaagtacagcgt parent) gatcgtgaccgtgcacaccggcgaccagcaccaggtgggcaacgagaccaccgagcac ggcaccatcgccaccatcaccccccaggcccccaccagcgagatccagctgaccgact acggcgccctgaccctggactgcagccccaggaccggcctggacttcaacagggtggt gctgctgaccatgaagaagaagagctggctggtgcacaagcagtggttcctggacctg cccctgccctggaccagcggcgccagcaccagccaggagacctggaacaggcaggacc tgctggtgaccttcaagaccgcccacgccaagaagcaggaggtggtggtgctgggcag ccaggagggcgccatgcacaccgccctgaccggcgccaccgagatccagaccagcggc accaccaccatcttcgccggccacctgaagtgcaggctgaagatggacaagctgaccc tgaagggcgtgagctacgtgatgtgcaccggcagcttcaagctggagaaggaggtggc cgagacccagcacggcaccgtgctggtgcaggtgaagtacgagggcaccgacgccccc tgcaagatccccttcagcagccaggacgagaagggcgtgacccagaacggcaggctga tcaccgccaaccccatcgtgatcgacaaggagaagcccgtgaacatcgaggccgagcc ccccttcggcgagagctacatcgtggtgggcgccggcgagaaggccctgaagctgagc tggttcaagaagggcagcagcatcggcaagatgttcgaggccaccgccaggggcgcca ggaggatggccatcctgggcgacaccgcctgggacctgggcagcatcggcggcgtgtt caccagcgtgggcaag 212. DEN-2 atgggcctgatcctgatcctgcagaccgccgtggccccctccatgacaatgaggtgca PRIM15/tE tcggcatcagcaacagggacttcgtggagggcgtgagcggcggcagctgggtggacat CO parent cgtgctggagcacggctcctgcgtgaccaccatggccaagaacaagcccaccctggac ttcgagctgatcaagaccgaggccacccagcccgccaccctgaggaagtactgcatcg aggccaagctgaccaataccaccaccgagagcaggtgccccacccagggcgagcccag cctgaacgaggagcaggacaagaggttcgtgtgcaagcacagcatggtggacaggggc tggggcaacggctgcggcctgttcggcaagggcggcatcgtgacctgcgccatgttca cctgcaagaagaacatggagggcaacatcgtgcagcccgagaacctggagtacaccat cgtgatcaccccccacagcggcgaggagcacgccgtgggcaacgacactggcaagcac ggcaaggagatcaagatcaccccccagagcagcatcaccgaggccgagctgaccggct acggcaccgtgaccatggagtgcagccccaggaccggcctggacttcaacgagatcgt gctgctgcagatggaggacaaggcctggctggtgcacaggcagtggttcctggacctg cccctgccctggctgcccggcgccgacacccagggcagcaaccgtatccagaaggaga ccctggtgaccttcaagaacccccacgccaagaagcaggacgtggtggtgctgggcag ccaggagggcgccatgcacaccgccctgaccggcgccaccgagatccagatgagcagc ggcaacctgctgttcaccggccacctgaagtgcaggctgaggatggacaagctgcagc tgaagggcatgagctacagcatgtgcaccggcaagttccagatcgtgaaggagatcgc cgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcgacggcagcccc tgcaagatccccctggagatcatggacctggagaagaggcacgtgctgggcaggctga tcaccgtgaaccccatcgtgaccgagaaggacagccccgtgaacatcgaggccgagcc ccccttcggcgacagctacatcatcatcggcgtggagcccggccagctgaagctgcac tggttcaagaagggcagcagcatcggccagatgttcgagaccaccatgaggggcgcca agaggatggccatcctgggcgacaccgcctgggacttcggcagcctgggcggcgtgtt caccagcatcggcaaggccctgcaccaggtgttcggcgccatctacggcgcc 213. DEN-3 atggtggtgatcttcatcctgctgatgctggtgaccccctccatgacaatgaggtgcg PRM15/tE tgggcgtgggcaacagggacttcgtggagggcctgagcggcgccacctgggtggacgt CO parent ggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcccaccctggac atcgagctgcagaagaccgaggccacccagctggccaccctgaggaagctgtgcatcg agggcaagatcaccaacatcaccaccgacagcaggtgccccacccagggcgaggccat cctgcccgaggagcaggaccagaactacgtgtgcaagcacacctacgtggacaggggc tggggcaacggctgcggcctgttcggcaagggcagcctggtgacctgcgccaagttcc agtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctgaagtacaccgt gatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgacacccagggcgtg accgtggagatcaccccccaggccagcaccgtggaggccatcctgcccgagtacggca ccctgggcctggagtgcagccccaggaccggcctggacttcaacgagatgatcctgct gaccatgaagaacaaggcctggatggtgcacaggcagtggttcttcgacctgcccctg ccctggaccagcggcgccaccaccgagacccccacctggaacaggaaggagctgctgg tgaccttcaagaacgcccacgccaagaagcaggaggtggtggtgctgggcagccagga gggcgccatgcacaccgccctgaccggcaccaccgagatccagaacagcggcggcacc agcatcttcgccggccacctgaagtgcaggctgaagatggacaagctggagctgaagg gcatgagctacgccatgtgcctgaacaccttcgtgctgaagaaggaggtgagcgagac ccagcacggcaccatcctgatcaaggtggagtacaagggcgaggacgccccctgcaag atccccttcagcaccgaggacggccagggcaaggcccacaacggcaggctgatcaccg ccaaccccgtggtgaccaagaaggaggagcccgtgaacatcgaggccgagcccccctt cggcgagagcaacatcgtgatcggcatcggcgacaaggccctgaagatcaactggtac aagaagggcagcagcatcggcaagatgttcgaggccaccgccaggggcgccaggagga tggccatcctgggcgacaccgcctgggacttcggcagcgtgggcggcgtgctgaacag cctgggcaagatggtgcaccagatcttcggcagcgcctacacc 214. DEN-4 atgaccgtgttcttcatcctgatgatgctggtggccccctcctacggcatgaggtgcg PRI5/tE tgggcgtgggcaacagggacttcgtggagggcgtgagcggcggcgcctgggtggacct CO parent ggtgctggagcacggcggctgcgtgaccaccatggcccagggcaagcccaccctggac ttcgagctgatcaagaccaccgccaaggaggtggccctgctgaggacctactgcatcg aggccagcatcagcaacatcaccaccgccaccaggtgccccacccagggcgagcccta cctgaaggaggagcaggaccagcagtacatctgcaggagggacgtggtggacaggggc tggggcaacggctgcggcctgttcggcaagggcggcgtggtgacctgcgccaagttca gctgcagcggcaagatcaccggcaacctggtgcagatcgagaacctggagtacaccgt ggtggtgaccgtgcacaacggcgacacccacgccgtgggcaacgacaccagcaaccac ggcgtgaccgccaccatcacccccaggagccccagcgtggaggtgaagctgcccgact acggcgagctgaccctggactgcgagcccaggagcggcatcgacttcaacgagatgat cctgatgaagatgaagaagaagacctggctggtgcacaagcagtggttcctggacctg cccctgccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaaggaga ggatggtgaccttcaaggtgccccacgccaagaggcaggacgtgaccgtgctgggcag ccaggagggcgccatgcacagcgccctggccggcgccaccgaggtggacagcggcgac ggcaaccacatgttcgccggccacctgaagtgcaaggtgaggatggagaagctgcgca tcaagggcatgagctacaccatgtgcagcggcaagttcagcatcgacaaggagatggc cgagacccagcacggcaccaccgtggtgaaggtgaagtacgagggcaccggcgccccc tgcaaggtgcccatcgagatcagggacgtgaacaaggagaaggtggtgggcaggatca tcagcagcacccccctggccgagaacaccaacagcgtgaccaacatcgagctggagcc ccccttcggcgacagctacatcgtgatcggcgtgggcgacagcgccctgaccctgcac tggttcaggaagggcagcagcatcggcaagatgttcgagagcacctacaggggcgcca agaggatggccatcctgggcgagaccgcctgggacttcggcagcgtgggcggcctgct gaccagcctgggcaaggccgtgcaccaggtgttcggcagcgtgtacaccaccatgttc ggc 215. DEN1-\ atgagatctgtgaccatgattctcatgctgctgcctactgctctggccttccatctga C15/full caacaagaggtggcgagcctaccctgatcgtgtccaagcaagaacgcggcaagagcct prM/full gctgttcaagacttctgcaggggtgaatatgtgcactctcatcgccatggacctgggc E CO gagctgtgtgaggataccatgacctacaagtgtccacggatgaccgaggccgaacccg parent acgatgtggattgctggtgcaatgcaactgatacttgggtgacctatgggacctgtag (wherein ccagacaggcgagcataggagggataagaggtccgtcgccctggaccctcacgttggt a codon ctgggcctggaaaccagaaccgagacatggatgagctctgagggagcttggaagcaga encoding tccagaaggttgagacttgggctctgcgccatccaggctttacagtcattgccctgtt a Met cctggctcatgcaattgggacctctattactcagaagggcatcatcttcattctgctc amino atgctcgtgacaccatctatggccatgaggtgcgtgggcatcggcaaccgcgacttcg acid tggagggcctgagcggcgccacctgggtggacgtggtgctggagcacggcagctgcgt resdue gaccaccatggccaagaacaagcccaccctggacatcgagctgctgaagaccgaggtg is also accaaccccgccgtgctgaggaagctgtgcatcgaggccaagatcagcaacaccacca included ccgacagcaggtgccccacccagggcgaggccaccctggtggaggagcaggacaccaa as the cttcgtgtgcaggaggaccttcgtggacaggggctggggcaacggctgcggcctgttc first ggcaagggcagcctgatcacctgcgccaagttcaagtgcgtgaccaagctggagggca codon agatcgtgcagtacgagaacctgaagtacagcgtgatcgtgaccgtgcacaccggcga contiguous ccagcaccaggtgggcaacgagaccaccgagcacggcaccatcgccaccatcaccccc and caggcccccaccagcgagatccagctgaccgactacggcgccctgaccctggactgca prior to gccccaggaccggcctggacttcaacagggtggtgctgctgaccatgaagaagaagag the first ctggctggtgcacaagcagtggttcctggacctgcccctgccctggaccagcggcgcc codon of agcaccagccaggagacctggaacaggcaggacctgctggtgaccttcaagaccgccc the acgccaagaagcaggaggtggtggtgctgggcagccaggagggcgccatgcacaccgc nuclectide cctgaccggcgccaccgagatccagaccagcggcaccaccaccatcttcgccggccac sequence ctgaagtgcaggctgaagatggacaagctgaccctgaagggcgtgagctacgtgatgt encodng gcaccggcagcttcaagctggagaaggaggtggccgagacccagcacggcaccgtgct C15) ggtgcaggtgaagtacgagggcaccgacgccccctgcaagatccccttcagcagccag gacgagaagggcgtgacccagaacggcaggctgatcaccgccaaccccatcgtgatcg acaaggagaagcccgtgaacatcgaggccgagccccccttcggcgagagctacatcgt ggtgggcgccggcgagaaggccctgaagctgagctggttcaagaagggcagcagcatc ggcaagatgttcgaggccaccgccaggggcgccaggaggatggccatcctcggcgata cagcctgggattttggctcaatcggaggggtgttcaccagcgtgggtaaactgattca tcagattttcggtacagcatatggcgtgctgttttccggcgtctcttggaccatgaag atcggtattggcatcctcctcacatggctgggactgaatagcaggagcacaagcctga gcatgacctgcatcgctgttggcatggtcacactctacctgggtgtgatggttcaggc a 216. DEN-2\ atgaacaggaggaggagaactgtgggcgtgattatcatgctgatccctactgctatgg C15/full ccttccatctgacaacaagaaatggcgagcctcacatgatcgtgggcaggcaagagaa prM/full gggcaagagcctgctgttcaagactgaggacggcgtgaatatgtgcactctcatggcc E CO atcgacctgggcgagctgtgtgaggataccatcacctacaagtgtccactgctgaggc parent agaacgaacccgaggatatcgattgctggtgcaattcaacttctacttgggtgaccta (a codon tgggacctgtaccaccacaggcgagcataggagggagaagaggtccgtcgccctggtt encodng cctcacgttggtatgggcctggaaaccagaaccgagacatggatgagctctgagggag a Met cttggaagcacgcccagaggatcgagacctggattctgcgccatccaggctttatcat residue catggctgccatcctggcttatacaattgggaccactcacttccagagagccctcatc is also ttcattctgctcaccgccgtggcaccatctatgaccatgaggtgcatcggcatcagca included acagggacttcgtggagggcgtgagcggcggcagctgggtggacatcgtgctggagca as the cggctcctgcgtgaccaccatggccaagaacaagcccaccctggacttcgagctgatc first aagaccgaggccacccagcccgccaccctgaggaagtactgcatcgaggccaagctga codon ccaataccaccaccgagagcaggtgccccacccagggcgagcccagcctgaacgagga contiguous gcaggacaagaggttcgtgtgcaagcacagcatggtggacaggggctggggcaacggc and tgcggcctgttcggcaagggcggcatcgtgacctgcgccatgttcacctgcaagaaga prior to acatggagggcaacatcgtgcagcccgagaacctggagtacaccatcgtgatcacccc the first ccacagcggcgaggagcacgccgtgggcaacgacactggcaagcacggcaaggagatc codon of aagatcaccccccagagcagcatcaccgaggccgagctgaccggctacggcaccgtga the ccatggagtgcagccccaggaccggcctggacttcaacgagatcgtgctgctgcagat nucleotide ggaggacaaggcctggctggtgcacaggcagtggttcctggacctgcccctgccctgg sequence ctgcccggcgccgacacccagggcagcaaccgtatccagaaggagaccctggtgacct encoding tcaagaacccccacgccaagaagcaggacgtggtggtgctgggcagccaggagggcgc c15) catgcacaccgccctgaccggcgccaccgagatccagatgagcagcggcaacctgctg ttcaccggccacctgaagtgcaggctgaggatggacaagctgcagctgaagggcatga gctacagcatgtgcaccggcaagttccagatcgtgaaggagatcgccgagacccagca cggcaccatcgtgatcagggtgcagtacgagggcgacggcagcccctgcaagatcccc ctggagatcatggacctggagaagaggcacgtgctgggcaggctgatcaccgtgaacc ccatcgtgaccgagaaggacagccccgtgaacatcgaggccgagccccccttcggcga cagctacatcatcatcggcgtggagcccggccagctgaagctgcactggttcaagaag ggcagcagcatcggccagatgttcgagaccaccatgaggggcgccaagaggatggcca tcctgggcgacaccgcctgggacttcggcagcctgggcggcgtgttcaccagcatcgg caaggccctgcaccaggttttcggtgcaatctatggcgcagccttttccggcgtctct tggaccatgaagatcctgatcggggtcatcatcacatggattggaatgaatagcagga gcacaagcctgagcgttagcctcgtcctggttggcgtcatcacactctacctgggtgc catggttcaggca 217. DEN-3\ atgaaaactagcctctgcctgatgatgatgctgcctgctactctggccttccatctga C15/full catcaagagatggcgagcctaggatgatcgtgggcaagaacgaaagagggaagagcct prM/full gctgttcaagactgcttcagggatcaatatgtgcactctcatcgccatggacctgggc E CO gagatgtgtgacgataccgtgacctacaagtgtccacacatcaccgaggtcgaacccg parent aggatatcgattgctggtgcaatctgacttctacttgggtgacctatgggacctgtaa (a codon ccaggctggcgagcataggagggataagaggtccgtcgccctggctcctcacgttggt encoding atgggcctggacaccagaacccagacatggatgagcgctgagggagcttggaggcagg a Met tcgagaaggttgaaacttgggctctgcgccatccaggctttacaatcctggccctgtt amino cctggctcattacattgggacctctctgactcagaaggtcgtcatcttcattctgctc acid atgctcgtgacaccatctatgaccatgaggtgcgtgggcgtgggcaacagggacttcg residue tggagggcctgagcggcgccacctgggtggacgtggtgctggagcacggcggctgcgt is also gaccaccatggccaagaacaagcccaccctggacatcgagctgcagaagaccgaggcc included acccagctggccaccctgaggaagctgtgcatcgagggcaagatcaccaacatcacca as the ccgacagcaggtgccccacccagggcgaggccatcctgcccgaggagcaggaccagaa first ctacgtgtgcaagcacacctacgtggacaggggctggggcaacggctgcggcctgttc codon ggcaagggcagcctggtgacctgcgccaagttccagtgcctggagcccatcgagggca contiguous aggtggtgcagcacgagaacctgaagtacaccgtgatcatcaccgtgcacaccggcga and ccagcaccaggtgggcaacgacacccagggcgtgaccgtggagatcaccccccaggcc prior to agcaccgtggaggccatcctgcccgagtacggcaccctgggcctggagtgcagcccca the first ggaccggcctggacttcaacgagatgatcctgctgaccatgaagaacaaggcctggat codon of ggtgcacaggcagtggttcttcgacctgcccctgccctggaccagcggcgccaccacc the gagacccccacctggaacaggaaggagctgctggtgaccttcaagaacgcccacgcca nucleotide agaagcaggaggtggtggtgctgggcagccaggagggcgccatgcacaccgccctgac sequence cggcaccaccgagatccagaacagcggcggcaccagcatcttcgccggccacctgaag encoding tgcaggctgaagatggacaagctggagctgaagggcatgagctacgccatgtgcctga C15) acaccttcgtgctgaagaaggaggtgagcgagacccagcacggcaccatcctgatcaa ggtggagtacaagggcgaggacgccccctgcaagatccccttcagcaccgaggacggc cagggcaaggcccacaacggcaggctgatcaccgccaaccccgtggtgaccaagaagg aggagcccgtgaacatcgaggccgagccccccttcggcgagagcaacatcgtgatcgg catcggcgacaaggccctgaagatcaactggtacaagaagggcagcagcatcggcaag atgttcgaggccaccgccaggggcgccaggaggatggccatcctgggcgacaccgcct gggacttcggcagcgtgggcggcgtgctgaacagcctgggcaagatggtgcaccagat cttcggttcagcatataccgcactgttttccggcgtctcttggatcatgaagatcggt atcggggtcctcctcacatggattggactgaatagcaagaatacaagcatgagcttta gctgcatcgctattggcatcatcacactctacctgggtgtggtggttcaggca 218. DEN-4\ atgagatctaccatcaccctgctctgcctgatccctactgttatggccttctctctgt C15/full caacaagagatggcgagcctctcatgatcgtggccaagcacgaaagagggaggcctct prM/full gctgttcaagactactgaagggatcaataagtgcactctcatcgccatggacctgggc E CO gagatgtgtgaggataccgtgacctacaagtgtccactgctggtcaacaccgaacccg parent aggatatcgattgctggtgcaatctgacttctacttgggtgatgtatgggacctgtac (a codon ccagtccggagagagaaggagggagaagaggtccgtcgccctgactcctcactctggt encoding atgggcctggaaaccagagccgagacatggatgagctctgagggagcttggaagcacg Met amino cccagagggttgaatcttggattctgcgcaatccacgctttgcactcctggctggctt acid catggcttatatgattgggcagactggtattcagaggaccgtcttcttcgttctgatg residue atgctcgtggcaccatcttacggcatgaggtgcgtgggcgtgggcaacagggacttcg is also tggagggcgtgagcggcggcgcctgggtggacctggtgctggagcacggcggctgcgt included gaccaccatggcccagggcaagcccaccctggacttcgagctgatcaagaccaccgcc as the aaggaggtggccctgctgaggacctactgcatcgaggccagcatcagcaacatcacca first ccgccaccaggtgccccacccagggcgagccctacctgaaggaggagcaggaccagca codon gtacatctgcaggagggacgtggtggacaggggctggggcaacggctgcggcctgttc contiguous ggcaagggcggcgtggtgacctgcgccaagttcagctgcagcggcaagatcaccggca and acctggtgcagatcgagaacctggagtacaccgtggtggtgaccgtgcacaacggcga prior to cacccacgccgtgggcaacgacaccagcaaccacggcgtgaccgccaccatcaccccc the first aggagccccagcgtggaggtgaagctgcccgactacggcgagctgaccctggactgcg codon of agcccaggagcggcatcgacttcaacgagatgatcctgatgaagatgaagaagaagac the ctggctggtgcacaagcagtggttcctggacctgcccctgccctggaccgccggcgcc nucleotide gacaccagcgaggtgcactggaaccacaaggagaggatggtgaccttcaaggtgcccc sequence acgccaagaggcaggacgtgaccgtgctgggcagccaggagggcgccatgcacagcgc encoding cctggccggcgccaccgaggtggacagcggcgacggcaaccacatgttcgccggccac C15) ctgaagtgcaaggtgaggatggagaagctgcgcatcaagggcatgagctacaccatgt gcagcggcaagttcagcatcgacaaggagatggccgagacccagcacggcaccaccgt ggtgaaggtgaagtacgagggcaccggcgccccctgcaaggtgcccatcgagatcagg gacgtgaacaaggagaaggtggtgggcaggatcatcagcagcacccccctggccgaga acaccaacagcgtgaccaacatcgagctggagccccccttcggcgacagctacatcgt gatcggcgtgggcgacagcgccctgaccctgcactggttcaggaagggcagcagcatc ggcaagatgttcgagagcacctacaggggcgccaagaggatggccatcctgggcgaga ccgcctgggacttcggcagcgtgggcggcctgctgaccagcctgggcaaggccgtgca ccaggtgttcggcagcgtgtacaccacaatgtttggcggcgtctcttggatggtgaga atcctgatcgggttcctcgtcctgtggattggaaccaatagcaggaatacaagcatgg ccatgagctgcatcgctgttggcggcatcacactcttcctgggtttcaccgttcacgc a 219. DEN-1 atgagatctgtgaccatgattctcatgctgctgcctactgctctggccttccatctga with rest caacaagaggtggcgagcctaccctgatcgtgtccaagcaagaacgcggcaagagcct of CO gctgttcaagacttctgcaggggtgaatatgtgcactctcatcgccatggacctgggc PRM- gagctgtgtgaggataccatgacctacaagtgtccacggatgaccgaggccgaacccg encoding acgatgtggattgctggtgcaatgcaactgatacttgggtgacctatgggacctgtag sequence ccagacaggcgagcataggagggataagaggtccgtcgccctggaccctcacgttggt ctgggcctggaaaccagaaccgagacatggatgagctctgagggagcttggaagcaga tccagaaggttgagacttgggctctgcgccatccaggctttacagtcattgccctgtt cctggctcatgcaattgggacctctattactcag 220. DEN-2 atgaacaggaggaggagaactgtgggcgtgattatcatgctgatccctactgctatgg with rest ccttccatctgacaacaagaaatggcgagcctcacatgatcgtgggcaggcaagagaa of CO gggcaagagcctgctgttcaagactgaggacggcgtgaatatgtgcactctcatggcc C15/PRM- atcgacctgggcgagctgtgtgaggataccatcacctacaagtgtccactgctgaggc encoding agaacgaacccgaggatatcgattgctggtgcaattcaacttctacttgggtgaccta sequence tgggacctgtaccaccacaggcgagcataggagggagaagaggtccgtcgccctggtt cctcacgttggtatgggcctggaaaccagaaccgagacatggatgagctctgagggag cttggaagcacgcccagaggatcgagacctggattctgcgccatccaggctttatcat catggctgccatcctggcttatacaattgggaccactcacttccag 221. DEN-3 atgaaaactagcctctgcctgatgatgatgctgcctgctactctggccttccatctga with rest catcaagagatggcgagcctaggatgatcgtgggcaagaacgaaagagggaagagcct of CO gctgttcaagactgcttcagggatcaatatgtgcactctcatcgccatggacctgggc c15/PRM- gagatgtgtgacgataccgtgacctacaagtgtccacacatcaccgaggtcgaacccg encoding aggatatcgattgctggtgcaatctgacttctacttgggtgacctatgggacctgtaa sequence ccaggctggcgagcataggagggataagaggtccgtcgccctggctcctcacgttggt atgggcctggacaccagaacccagacatggatgagcgctgagggagcttggaggcagg tcgagaaggttgaaacttgggctctgcgccatccaggctttacaatcctggccctgtt 222. cctggctcattacattgggacctctctgactcag DEN-4 atgagatctaccatcaccctgctctgcctgatccctactgttatggccttctctctgt with rest caacaagagatggcgagcctctcatgatcgtggccaagcacgaaagagggaggcctct of CO gctgttcaagactactgaagggatcaataagtgcactctcatcgccatggacctgggc C15/PRM- gagatgtgtgaggataccgtgacctacaagtgtccactgctggtcaacaccgaacccg encoding aggatatcgattgctggtgcaatctgacttctacttgggtgatgtatgggacctgtac sequence ccagtccggagagagaaggagggagaagaggtccgtcgccctgactcctcactctggt atgggcctggaaaccagagccgagacatggatgagctctgagggagcttggaagcacg cccagagggttgaatcttggattctgcgcaatccacgctttgcactcctggctggctt catggcttatatgattgggcagactggtattcag 223. DEN-1 ctgattcatcagattttcggtacagcatatggcgtgctgttttccggcgtctcttgga with rest ccatgaagatcggtattggcatcctcctcacatggctgggactgaatagcaggagcac of CO aagcctgagcatgacctgcatcgctgttggcatggtcacactctacctgggtgtgatg EnV- gttcaggca encoding sequence 224. DEN-2 gccttttccggcgtctcttggaccatgaagatcctgatcggggtcatcatcacatgga with rest ttggaatgaatagcaggagcacaagcctgagcgttagcctcgtcctggttggcgtcat of CO cacactctacctgggtgccatggttcaggca EnV- encoding sequence 225. DEN-3 gcactgttttccggcgtctcttggatcatgaagatcggtatcggggtcctcctcacat with rest ggattggactgaatagcaagaatacaagcatgagctttagctgcatcgctattggcat of CO catcacactctacctgggtgtggtggttcaggca EnV- encoding sequence 226. DEN-4 ggcgtctcttggatggtgagaatcctgatcgggttcctcgtcctgtggattggaacca with rest atagcaggaatacaagcatggccatgagctgcatcgctgttggcggcatcacactctt of CO cctgggtttcaccgttcacgca EnV- encoding seq 227. DEN-1\ MRSVTMTLMLLPTALAFHLTTRGGEPTLIVSKQERGKSLLFKTSAGVNMcTLIAMDLG C15/full ELCEDTMTYKCPRMTEAEPDDVDcWcNATDTWVTYGTCSQTGEHRRDKRSVALDPHVG prM/full LGLETRTETWMSSEGAWKQIQKVETWALRHPGFTVIALFLAHATGTSTTQKGIIFILL E parent MLVTPSMAMRcVGIGNRDFVEGLSGATWVDVVLEHGScVTTMAKNKPTLDIELLKTEV TNPAVLRKLCIEAKISNTTTDSRCPTQCEATLVEEQDTNFVCRRTFVDRGWGNGCGLF GKGSLITcAKFKCVTKLEGKTVQYENLKYSVIVTVHTGDQHQVCNETTEHGTIATITP QAPTSEIQLTDYGALTLDCSPRTGLDFTNRWLLTMKKKSWLVHKQWFLDLPLPWTSGA STSQETWNRQDLLVTFKTAHAKKQEVVVLGSQEGAMHTALTGATEIQTSGTTTIFAGH LKCRLKMDKLTLKGVSYVMcTGSFKLEKEVAETQHGTVLVQVKYEGTDAPcKIPFSSQ DEKGVTQNGRLITANPIVTDKEKPVNIEAEPPFGESYTVVGAGEKALKLSWFKKGSSI CKMFEATARGARRMAILGDTAWDFGSIGGVPTSVGKLIHQIFGTAYGVLFSGVSWTMK IGIGILLTWLGLNSRSTSLSMTCIAVGMVTLYLGVMVQA 228. DEN-2\ MNRRRRTVGVIIMLIPTAMAFHLTTRNGEPHMIVGRQEKGKSLLFKTEDGVNMCTLMA C15/full IDLGELCEDTITYKCPLLRQNEPEDIDCWCNSTSTWVTYGTCTTTGEHRREKRSVALV prM/full PHVGMGLETRTETWMSSEGAWKHAQRTETWILRHPGFIIMAAILAYTIGTTHFQRALI E parent FILLTAVAPSMTMMcIGISNRDFVEGVSGGSWVDIVLEHGScVTTMAKNKPTLDFELI KTEATQPATLRKYCIEAKLTNTTTESRCPTQGEPSLNEEQDKRFVCKISMVDRGWGNG CGLFGKGGIVTCAMFTcKKNMEGNIVQPENLEYTIVITPRSGEEHAVGNIJTGKGKEI KITPQSSITEAELTGYGTVTMEcSPRTGLDFNEITLLQMEDKAWLVHRQWFLDLPLPW LPGADTQGSNRIQKETLVTFKNPHAKKDVVVLGSQEGAMIITALTGATEIQMSSGNLL FTGHLKCRLRMDKLQLKGMSYSMCTGKFQIVKEIAETQHGTIVIRVQYEGDGSPCKIP LETMDLEKRIVLGRLITVNPIVTEKDSPVNIEAEPPFGDSYIIIGVEPGQLKLHWFKK GSSIGQMFETTMRGAKRMAILGDTAWDFGSLGGVFTSIGKALHQVFGAIYGAAFSGVS WTMKILIGVTITWIGMNSRSTSLSVSLVLVGVITLYLGAMVQA 229. DEN-3\ MTSLCLMMRMLPATLAFHLTSRDGEPRMIVGKNERGKSLLPKTASGINMCTLIAMDLG C15/full EMCDDTVTYKCPHITEVEPEDIDCWCNLTSTWVTYGTcNQAGEHRRDKRSVALAPHVG prM/full MGLDTRTQTWMSAEGAWRQVEKVETWALRHPGFTILALFLAHYIGTSLTQKVVIFILL E parent MLVTPSMTMRCVGVGNRDFVEGLSGATWVDVVLEHGGCVTTMAKNKPTLDIELQKTEA TQLATLRKLCIEGKITNITTDSRCPTQGEAILPEEQDQNYVCKHTYVDRGWGNGCGLF GKGSLVTCAKFQCLEPIEGKVVQHENLKYTVIITVRTGDQHQVGNDTQGVTVEITPQA STVEAILPEYGTLGLESPRTGLDFNEMILLTMIKNKAWMVHRQWFFDLPLPWTSGATT ETPTWNRKELLVTFKNAHAKKQEVVVLGSQEGAIHTALTGTTEIQNSGGTSIFAGHLK CRLKMDKLELKGMSYAMCLNTFVLKKEVSETQHGTILIKVEYKGEDAPCKIPFSTEDC QGKAHNGRLITANPVVTKKEEPVNIEAEPPFGESNIVIGIGDKALKINWYKKGSSIGK MFEATARGARRMAILGDTAWDFGSVGGVLNSLGKMVHQIFGSAYTALFSGVSWIMKIG IGVLLTWIGLNSKNTSMSFSCIAIGIITLYLGVVVQA 230. DEN-4\ MRSTITLLCLIPTVMAFSLSTRDGEPLMIVAKHERGRPLLFKTTEGINKcTLIAMDLG C15/full EMCEDTVTYKCPLLVNTEPEDIDCWCNLTSTWVMYGTCTQSGERRREKRSVALTPHSG prM/full MGLETPAETWMSSEGAWKHAQRVESWILRNPRFALLAGFMAYMTGQTGIQRTVFFVLM E parent MLVAPSYGMRCVGVGNRDFVEGVSGGAWVDLVLEHGGCVTTMAQGKPTLDFELIKTTA KEVALLRTYCIEASISNITTATRCPTQGEPYLKEEQDQQYICRRDVVDRGWGNGCGLF GKGGVVTCAKFSCSGKITGNLVQIENLEYTVVVTVHNDTHAVGNDTSNIIGVTATTTP RSPSVEVKLPDYGELTLDCEPRSGIDFNEMILMKMKKKTWLVHKQWFLDLPLPWTAGA DTSEVHWNHKERMVTFKVPHAKRQDVTVLGSQEGAMHSALAGATEVDSGDGNHMFAGH LKCKVPMEKLRITKGMSYTMCSGKFSIDKEMAETQHGTTVVKVKYEGTGAPCKVPIEM DVNKEKVVGRIISSTPLAENTNSVTNIELEPPFGDSYTVIGVGDSALTLHWFRKGSSI GKNFESTYRGAKRMATLGETAWDFGSVGGLLTSLGKAVHQVFGSVYTTDFGGVsWNVR ILIGFLVLWTGTNSRNTSMAMSCIAVGGITLFLGFTVRA 231. DEN-1 actgtcttctttgtcctaatgatgctggtcgccccatcctacggaatgcgatgcgtag PRM15/E gagtaggaaacagagactttgtggaaggagtctcaggtggagcatgggtcgacctggt trunc WT gctagaacatggaggatgcgtcacaaccatggcccagggaaaaccaaccttggatttt cDNA gaactgactaagacaacagccaaggaagtggctctgttaagaacctattgcattgaag sequence cctcaatatcaaacataactacggcaacaagatgtccaacgcaaggagagccttatct fragMent gaaagaggaacaggaccaacagtacatttgccggagagatgtggtagacagagggtgg Nucleic ggcaatggctgtggcttgtttggaaaaggaggagttgtgacatgtgcgaagttttcat acids gttcggggaagataacaggcaatttggtccaaattgagaaccttgaatacacagtggt 894-2285 tgtaacagtccacaatggagacacccatgcagtaggaaatgacacatccaatcatgga of gttacagccatgataactcccaggtcaccatcggtggaagtcaaattgccggactatg GenBank gagaactaacactcgattgtgaacccaggtctggaattgactttaatgagatgattct Acc. No. gatgaaaatgaaaaagaaaacatggctcgtgcataagcaatggtttttggatctgcct AB074761 cttccatggacagcaggagcagacacatcagaggttcactggaattacaaagagagaa tggtgacatttaaggttcctcatgccaagagacaggatgtgacagtgctgggatctca ggaaggagccatgcattctgccctcgctggagccacagaagtggactccggtgatgga aatcacatgtttgcaggacatcttaagtgcaaagtccgtatggagaaattgagaatca ggaatgtcatacacgatgtgttcaggaaagtt ttcaattgacaaagagatggcaga aacacagcatgggacaacagtggtgaaagtcaagtatgaaggtgctggagctccgtgt aaagtccccatagagataagagatgtaaacaaggaaaaagtggttgggcgtatcatct catccacccctttggctgagaataccaacagtgtaaccaacatagaattagaaccccc ctttggggacagctacatagtgataggtgttggaaacagcgcattaacactccattgg ttcaggaaagggagttccattggcaagatgtttgagtccacatacagaggtgcaaaac gaatggccattctaggtgaaacagcttgggattttggttccgttggtggactgttcac atcattgggaaaggctgtgcaccaggtttttggaagtgtgtatacaaccatgtttgga 232. DEN-2 gtcctgatattcatcctactgacagccatcgctccttcaatgacaatgcgctgcatag PRM15/E gaatatcaaatagggactttgtggaaggagtgtcaggagggagttgggttgacatagt trunc WT tttagaacatggaagttgtgtgacgacgatggcaaaaaataaaccaacactggacttt cDNA gaactgataaaaacagaagccaaacaacccgccaccttaaggaagtactgtatagagg sequence ctaaactgaccaacacgacaacagactcgcgctgcccaacacaaggggaacccaccct fragment gaatgaagagcaggacaaaaggtttgtctgcaaacattccatggtagacagaggatgg Nucleic ggaaatggatgtggattatttggaaaaggaggcatcgtgacctgtgccatgttcacat acids gcaaaaagaacatggagggaaaaattgtgcagccagaaaacctggaatacactgtcgt 892-2274 tataacacctcattcaggggaagaacatgcagtcggaaatgacacaggaaaacatggt of aaagaagtcaagataacaccacagagctccatcacagaggcggaactgacaggctatg GenBank gcactgttacgatggagtgctctccaagaacgggcctcgacttcaatgagatggtgtt Acc. No. gctgcaaatgaaagacaaagcttggctggtgcacagacaatggttcctagacctaccg NC_001474 ttgccatggctgcccggagcagacacacaaggatcaaattggatacagaaagagacac tggtcaccttcaaaaatccccatgcgaaaaaacaggatgttgttgtcttaggatccca agagggggccatgcatacagcactcacaggggctacggaaatccagatgtcatcagga aacctgctgttcacaggacatcttaagtgcaggctgagaatggacaaattacaactta aagggatgtcatactccatgtgcacaggaaagtttaaagttgtgaaggaaatagcaga aacacaacatggaacaatagtcattagagtacaatatgaaggagacggctctccatgc aagaccccttttgagataatggatctggaaaaaagacatgttttgggccgcctgacca cagtcaacccaattgtaacagaaaaggacagtccagtcaacatagaagcagaacctcc attcggagacagctacatcatcataggagtggaaccaggacaattgaagctggactgg ttcaagaaaggaagttccatcggccaaatgtttgagacaacaatgaggggagcgaaaa gaatggccattttgggcgacacagcctgggattttggatctctgggaggagtgttcac atcaataggaaaggctctccaccaggtttttggagcaatctacggggct 233. DEN-3 gttatttttatactattaatgctggttaccccatccatgacaatgagatgtgtgggag PRM15/E taggaaacagagattttgtggaaggcctatcgggagctacgtgggttgacgtggtgct trunc WT cgagcacggtgggtgtgtgactaccatggctaagaacaagcccacgctggacatagag cDNA cttcagaagactgaggccactcagctggcgaccctaaggaagctatgcattgagggaa sequence aaattaccaacataacaaccgactcaagatgtcccacccaaggggaagcgattttacc fragment tgaggagcaggaccagaactacgtgtgtaagcatacatacgtggacagaggctgggga Nucleic aacggttgtggtttgtttggcaagggaagcttggtgacatgcgcgaaatttcaatgtt acids tagaatcaatagagggaaaagtggtgcaacatgagaacctcaaatacaccgtcatcat (nt) 893- cacagtgcacacaggagaccaacaccaggtgggaaatgaaacgcagggagttacggct 2263 of gagataacatcccaggcatcaaccgctgaagccattttacctgaatatggaaccctcg GenBank ggctagaatgctcaccacggacaggtttggatttcaatgaaatgattttattgacaat Acc. No. gaagaacaaagcatggatggtacatagacaatggttctttgacttacccctaccatgg M25277 acatcaggagctacaacaaaaacaccaacttggaacaggaaagagcttcttgtgacat ttaaaaatgcacatgcaaaaaagcaagaagtagttgtccttggatcacaagagggagc aatgcatacagcactgacaggagctacagagatccaaacctcaggaggcacaagtatt tttgcggggcacttaaaatgtagactcaagatggacaaattgaaactcaaggggatga gctatgcaatgtgcttgaatacctttgtgttgaagaaagaagtctccgaaacgcagca tgggacaatactcattaaggttgagtacaaaggggaacatgcaccctgcaagattcct ttctccacggaggatggacaagggaaagctcacaatggcagactgatcacagccaatc cagtggtgaccaagaaggaggagcctgtcaacattgaggctgaacctccttttgggga aagtaatatagtaattggaagtggagacaaagccctgaaaatcaactggtacaggaag ggaagctcgattgggaagatgttcgaggccactgccagaggtgcaaggcgcatggcca tcttgggagacacagcctgggactttggatcagtgggtggtgttttgaattcattagg gaaaatggtccaccaaatatttgggagtgcttacaca 234. DEN-4 actgtcttctttgtcctaatgatgctggtcgccccatcctacggaatgcgatgcgtag PRX15E gagtaggaaacagagactttgtggaaggagtctcaggtggagcatgggtcgacctggt trunc WT gctagaacatggaggatgcgtcacaaccatggcccagggaaaaccaaccttggatttt cDNA gaactgactaagacaacagccaaggaagtggctctgttaagaacctattgcattgaag sequence cctcaatatcaaacataactacggcaacaagatgtccaacgcaaggagagccttatct fragment gaaagaggaacaggaccaacagtacatttgccggagagatgtggtagacagagggtgg Nucleic ggcaatggctgtggcttgtttggaaaaggaggagttgtgacatgtgcgaagttttcat acids gttcggggaagataacaggcaatttggtccaaattgagaaccttgaatacacagtggt (nt) 893- tgtaacagtccacaatggagacacccatgcagtaggaaatgacacatccaatcatgga of gttacagccatgataactcccaggtcaccatcggtggaagtcaaattgccggactatg GenBank gagaactaacactcgattgtgaacccaggtctggaattgactttaatgagatgattct Acc. No. gatgaaaatgaaaaagaaaacatggctcgtgcataagcaatggtttttggatctgcct M25277 cttccatggacagcaggagcagacacatcagaggttcactggaattacaaagagagaa tggtgacatttaaggttcctcatgccaagagacaggatgtgacagtgctgggatctca ggaaggagccatgcattctgccctcgctggagccacagaagtggactccggtgatgga aatcacatgtttgcaggacatcttaagtgcaaagtccgtatggagaaattgagaatca agggaatgtcatacacgatgtgttcaggaaagttttcaattgacaaagagatggcaga aacacagcatgggacaacagtggtgaaagtcaagtatgaaggtgctggagctccgtgt aaagtccccatagagataagagatgtaaacaaggaaaaagtggttgggcgtatcatct catccacccctttggctgagaataccaacagtgtaaccaacatagaattagaaccccc ctttggggacagctacatagtgataggtgttggaaacagcgcattaacactccattgg ttcaggaaagggagttccattggcaagatgtttgagtccacatacagaggtgcaaaac gaatggccattctaggtgaaacagcttgggattttggttccgttggtggactgttcac atcattgggaaaggctgtgcaccaggtttttggaagtgtgtatacaaccatgtttgga 235. 18H6 atggtggtgatcttcatcctgctgatgctggtgaccccctccatgacaatgaggtgcg check tgggcgtgggcaacagggacttcgtggagggcctgagcggcgccacctgggtggacgt this ggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcccaccctggac PRM15/tE ttcgagctgatcaagaccaccgccaaggaggtggccctgctgaggacctactgcatcg Round II aggccagcatcagcaacatcaccaccgccaccaggtgccccacccagggcgaggccat shuffled cctgcccgaggagcaggaccagaactacgtgtgcaagcacacctacgtggacaggggc clone tggggcaacggctgcggcctgttcggcaagggcagcctggtgacctgcgccaagttca cctgcaagaagaacatggagggcaacatcgtgcagcccgagaacctggagtacaccat cgtgatcaccccccacaccggcgaccagcaccaggtgggcaacgacacccagggcgtg accgtggagatcaccccccaggccagcaccgtggaggccatcctgcccgagtacggca ccctgggcctggagtgcagccccaggactggcctggacttcaacagggtggtgctgct gaccatgaagaagaagagctggctggtgcacaagcagtggttcctggacctgcccctg ccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaaggagaggatgg tgaccttcaagaacgcccacgccaagaggcaggacgtgaccgtgctgggcagccagga gggcgccatgcacagcgccctggccggcgccaccgagatccagatgagcagcggcaac ctgctgttcaccggccacctgaagtgcaggctgaagatggacaagctgcagctgaagg gcgtgagctacgtgatgtgcaccggcagcttcaagctggagaaggaggtggccgagac ccagcacggcaccgtgctggtgcaggtgaagtacgagggcaccgacgccccctgcaag atccccttcagcaccgaggacggccagggcaaggcccacaacggcaggctgatcaccg tgaaccccatcgtgatcgacaaggagaagcccgtgaacatcgagctggagcccccctt cggcgagagctacatcgtggtgggcgccggcgagaaggccctgaagctgagctggttc aagaagggcagcagcatcggcaagatgttcgaggccaccgccaggggcgccaagagga tggccatcctgggcgagaccgcctgggacttcggcagcgtgggcggcctgctgaccag cctgggcaaggccgtgcaccaggtgttcggcgccatctacggcgcc 236. 25B6 MRSTITLLCLIPTVMAFSLSTRDGEPLMIVAKHERGRPLLFKTTEGINKTLTANJJLG (C15/full ELCEDTITYKCPLLRQDEPEDIDCWCNATDTWVTYGTGNQAGEHRRDKRSVALTPHSG length MGLETRAETWMSSEGAWKQIQKVETWALRHPGFTVIALFLAHAIGTSITQKGIIFILL prM/full MLVTPSMAMRCVGVGNRDFVEGLSGATWVDVVLEHGGCVTTMAKNKPTLDIELQKTEA length E) TQLATLRKLCIEGKITNITTDSRCPTQGEAILPEEQDQQYTCRRDVVDRGWGNGCGLF GKGSLVTCAKFQCLEPIEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGVTVEITPQA STVEAILPEYGTLGLECSPRTGLDFNEMILLTMKNKAWMVHKQWFLDLPLPWTSGAST SQETWNRQDLLVTFKTAHAKKQEVVVLGSQEGAMHTALTGATEIQMSSGNLLFTGHLK CRLKNDKLELKGMSYAMCLNTFVLKKEVAETQHGTVLVQVKYEGTDAPCKIPFSSQDE KGVTQNGRLITANPIVIDKEKPVNIEAEPPFGESYIVIGIGDKALKINWYKIGSSIGK MFEATARGARRIAILGDTAWDFGSIGGVFTSVGKLIHQIFGTAYGVLFSGVSWTMKIG IGTLLTWLGLIJSRSTSMSFSCIAIGIITLYLGVVVQA 237. 25B10 MRSVTMTLMLLPTTLAFHLTSRDGEPRIIVAKHERGRPLLFKTTEGTNKCTLIANDLG (C15/full EMCDDTVTYKCPHITEVEPEDIDCWCNSTSTWVTYGTcNQAGEHPRDKRSVALAPHVG length MGLDTRTQTWMSAEGAWKHAQRIETWILRHPGFIIMAAILAYTIGTTHFQRALIFTLL prM/full TAAAPSMTMRcTGISNPFLFVEGVSGGSWVMDIVLEHGScVTTMAKPTLDIELLKTEV length E) TNPAVLRKLCIEAKISNTTTDSRCPTQGEATLPEEQDQNYVCKHTYVDRGWGNGCGLF GKGSLVTCAKFQCLEPTEGKVVQHENLKYTVIITVHTGDQHQVCNDTQGVTVEITPQA STVEAILPEYGTLGLEcSPRTGLDFNEMILLTMKNKAWMVHRQWFFDLPLPWTSGATT ETPTWNRKELLVTFKTAHAKKQEVVVLGSQEGAMHSALAGATEVQMSSGNLLFTGHLK CRLRMDKLQLKGMSYSMCTGKFQIVKEIAETQHGTIVIRVEYKGEDAPCKIPFSTEDG QGKAHNGRLITANPVVTKKEEPVNIEAEPPFGKSNIVIGIGDKALKINWYKKGSSIGK MFEATARGARRMATLGDTAWIFGSIGGVFTSIGKALHQVFGSVYTTMFGGVSWMVRIL IGFLVLWIGTNSRSTSLSMTCIAVGMVTLYLGVMVQA 238. 25D4 MRSVTMILMLLPTALAFHLTTRGGEPTLIVSKQERGKSLLFKTSAGINMCTLIAMDLG (C15/full/ EMCDDTVTYKCPHITEVEPEDIDCWCNLTSTWVTYGTcNQAcEHRRDKRSVALVPHVG length MGLETRTETWMSSEGAWKHAQRIETWTLRHPGFTILALFLAHYIGTSLTQKVVIFILL prM/full MLVTPSMTMRcIGISNRDFVEGLSGATWVDVVLEHGScVTTMAKNKPTLDFELIKTEA length E) TQPATLRKYCIEAKLTNITTDSRCPTQGEATLPEEQDQNYVCKHTYVDRGWGNGCGLF GKGSLVTCAKFQCLEPIEGKVVQHENLKYTVVVTVHKGDTHAVGNDTSNHGVTATITP RSPSVEVKLPDYGELTLDCEPRSGIDFNEMILMKMKKKTWLVHKQWFLDLPLPWTAGA DTSEVHWNHKERMVTFKVPHAKRQDVTVLGSQEGAMHTALTGATEIQMSSGNHMFAGH LKCRLRMDKLQLKGMSYSMCTGKFQIVKEIAETQHGTIVIRVQYEGDGSPCKIPLEIM DLEKRHVLGRLITANPVVTKKDSPVNIEAEPPFGDSYITIcVEPGQLKLHWPKKGSST GQMFESTYRGAKPMAILGETAWDFGSVGGLLTSLGKAVHQVFGSVYTTMFGGVSWMVR ILTGFLVLWIGTNSRNTSMAMScTAVGGITLFLGFTVQA 239. 25E112 MKTSLCLMMMLPATLAFHLTSRDGEPRMIVGKNERGKSLLFKTASGINMCTLIAMDLG (C15/full EMCDDTVTYKCPHITEVEPEDIDCWCNLTSTWVTYGTCTQSGERRREKRSVALTPHSG length MGLETRTETWMSSECAWKQIQKVETWALRHPGFTVTALFLAHAIGTSTTQKGIIFTLL prM/full MLVTPSMAMRCVGVGNRDFVEGLSGATWVDIVLEHGSCVTTMAKNKPTLDFELIKTEA length E) TQPATLRKLCIEGKITNITTDSRCPTQGEAILPEEQDQNYVCKHTYVDRGWGNGCGLF GKGSLVTCAKFQCLEPIEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGVTVEITPQA STVEAILPEYGALTLDCSPRTGLDFNRVVLLTIKKKSWLVHKQWFLDLPLPWTSGAST SQETWNRQDLLVTFKTAHAKKQEVVVLGSQEGAMHTALTGTTEIQNSGGTSTFAGHLK CRLKMDKLTLKGVSYVMCTGSFKLEKEVSETQHGTILIKVEYKGEDAPCKVPIEIRDV NKEKVVGRTISSTPLAENTNSVTNTELEPPFGDSYIVIGVGDSALTLHWFRKGSSIGK MFEATARGARRMAILGDTAWDFGSICGVFTSVGKLIHQIFGTAYGVLFSGVSWTMKTG IGILLTWLGLNSRSTSLSMTCIAVGMVTLYLGVMVQA 240. 25H4 MRSVTMILMLLPTALAFHLTTRGGEPLMIVAKHERGKSLLFKTASGINMCTLIAMDLG (C15/full EMCDDTVTYKCPRMTEVEPEDIDCWCNLTSTWVTYGTCTTTGEHRREKRSVALVPHVG length MGLETRTETWMSSEGAWKQIQKVETWALRHPGFTILALFLAHYIGTSLTQEVVIFILL prM full MLVTPSMAIARCVGIGNRDFVEGLSGATWVDVVLEHGSVTTMAKNKPTLDTELLKTEV length E) TNPAVLRKLCIEAKISNTTTDSRCPTQGEAILPEEQDQNYVCKHTYVDRGWGNGCGLF GKGSLVTCAKFQCLEPIEGKVVQHENLKYTVIVTVHTGDQHQVGNDTQGVTVEITPQA STVEAILPEYGTLGLEcSPRTGLDFNEMILLTMKNKAWMVIKQWPLDLPLPWTSGAST SQETWNRQDLLVTFKTAHAKKQDVVVLGSQEGAMHTALTGATEIQMSSGNLLFTGHLK CRLRMDKLQLKGMSYSMCTGKFQIVKEIAETQHGTIVIRVQYEGDGSPCKIPLEIMDL EKRHVLGRLTTVNPIVTEKDSPVNIEAEPPFGDSYIIIGVEPGQLKLHWFKKGSSIGQ MFETTMRGAKRMAILGDTAWDFGSLGGVFTSIGKALHQTFGAIYGAAFSGVSWTMKTL IGVIITWIGMNSRSTSLSVSLVLVGVITLYLGAMVQA 241. 27G6 MKTSLCLMMMLPTALAFHLTTRGGEPTLIVSKQERGKSLLFKTSAGVNMCTLIAMDLG (C15/full ELCEDTMTYKCPRMTEAEPDDVDCWCNATDTWVTYGTCSQTGEHRREKRSVALVPHVG length MGLETRTETWNSSEGAWKQIQKVETWALRHPGFTVIALFLAHAIGTSITQKGIIFILL prM/full MLVTPSMAMRCVGIGNRDFVEGVSGGSWVDIVLEHGScVTTMAKNKPTLDIELLKTEV length E) TNPAVLRKLCIEAKISNTTTDSRCPTQGEAILPEEQDQNYVCKHTYVDRGWGNGCGLF GKGSLVTCAKFQCLEPIEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGVTVEITPQA STVEAILPDYGELTLDCEPRSGIDFNEMTLLTMKNKAWMVHRQWFFDLPLPWTSGATT ETPTWNRKELLVTFKNAHAKKQEVVVLGSQEGAMHTALTGTTEIQNSGGTSTFAGHLK CKVRMEKLRIKGMSYTMCSGKFSIDKEMAETQHGTTVVKVKYEGTGAPCKVPIEIRDV NKEKVVGRIISSTPLAENTNSVTNTELEPPFGDSYIVIGVGDSALTLHWFRKGSSIGK MFESTYRGAKRMATLGETAWDFGSVGGLLTSLGKAVHQVFGSVYTTMFGGVSWMVRIL IGFLVLWIGTNSRNTSMAMSCIAVGGITLFLGFTVHA 242. 27G6 MRSTITLLCLIPTVMAFSLSTRDGEPLMIVAKHERGRPLLFKTTEGINKCTLIAMDLG (C15/full EMCEDTVTYKCPLLVNTEPEDIDCWCNSTSTWVMYGTCSQTGEHRRDKRSVALVPHVG length MGLETRTETWMSSEGAWKQTQKVETWALRHPGFTVIALFLAHYIGTSLTQKVVIFILL prM/full MLVTPSMTMRCVGIGNRDFVEGLSGATWVDVVLEHGSVTTMAKIJKPTLDIELLKTEV length E) TNPAVLRKLCIEAKISNTTTDSRCPTQGEPYLKEEQDQQYICRRDVVDRGWGNGCGLF GKGSLITCAKFKCVTKLEGKIVQYENLEYTIVITPHSGEEHAVGNDTSNIGVTATITP RSPSVEVKLPGYGTVTMEcSPRTGLDFNRVVLLTMKKKSWLVHKQWFLDLPLPWTSGA TTETPTWNRKELLVTFKNAHAKRQDVTVLGSQEGAMHSALAGATEVDSGDGNHMFAGH LKCKVPMEKLTLKGVSYVMCTGSFKLEKEVAETQHGTVLVQVKYEGTDAPCKIPFSSQ DEKGVTQNGRLITANPVVTKKEEPVNIEAEPPFGESNIVTGVGDSALTLHWPRKGSSI GKMFESTYRGARRMAILGDTAWDFGSVGGLLTSLGKAVHQVFGSVYTTMFGGVSWMVR ILIGFLVLWIGTNSRNTSMAMSCIAVGGITLFLGFTVHA 243. 28A11 MRSTITLLCLIPTVMAFHLTTRNGEPRMIVGKNERGKSLLFKTEDGVNMCTLMAIDLG (c15/full ELCEDTITYKCPLLVNTEPEDIDCWCNLTSTWVTYGTcNQAGEHRRDKRSVALAPHVG length MGLETRTETWMSSEGAWKQIQKVETWALRHPGFTILALFLAHATGTSITQKGIIFILL prM/full MLVTPSMAMRCVGTGNRDFVEGLSGATWVDVVLEHGScVTTMAKNKPTLDIELLKTEV length E) TNPAVLRKLCIEAKITNITTDSRCPTQGEAILPEEQDQNYVCKHSMVDRGWGNGCGLF GKGSLVTCAKFQCLEPIEGKVVQHENLKYTVIITTHTGDQIQVGNDTQGVTVEITPQA STVEAILPEYGTLGLEcSPRTcLDFNEMILMKMKKKTWLVHRQWFLDLPLPWTSGAST SQETWNRQDLLVTFKNAHAKKQEVVVLGSQEGAMHTALTGATEIQTSGTTTIFAGHLK CKVRMEKLRIKGMSYTMCSGKFSIDKEMAETQHGTTVVKVKYEGTGAPCKVPIEIRDV NKEKVVGRIISSTPLAENTNSVTNIELEPPFGDSYIVIGVGDSALTLHWFRKGSSIGK MFESTYRGAKRNAILGETAWDFGSVGGLLTSLGKAVHQVPGSVYTTMFGGVSWMVRIL IGFLVLWIGTNSRNTSMAMSCIAIGIITLYLGVVVQA 244. 28C1 MKTSLCLVTHTHILPATMAFSLSTRDGPLMIVAKERGRPLFKTTEGINKCTLIAMDLG (C15/full) ELCEDTMTYKCPLLRQNEPEDIDCWCNSTSTWVTYGTCNQAGEHRRDKRSVALDPHVG length LGLETRTQTWMSAEGAWRQTEKVETWALRHPCFTVIALFLAHAIGTSITQKGIIFILL prM/full MLVTPSMAMRCVGIGNRDFVEGLSGATWVDVVLEHGGCVTTMAKNKPTLDFELIKTTA length E) KEVALLRTYCIEASISNITTATRCPTQGEPYLKEEQDQQYICRRDVTDRGWGNGCGLF GKCGVVTCAKFSCSGKITGNLVQIENLEYTVVVTVHNGDTHAVGNDTSNHGVTATITP RSPSVEVKLPDYGELTLDCEPRSGIDFNEMILMKMKKKTWLVHKQWFLDLPLPWTAGA DTSEVHWNHKERMVTFKVPHAKRQDVTVLGSQEGAMHSALAGATEVDSGDGNHMFAGH LKCKVRM*KLRIKGMSYTMCSGKFSIDKEMAETQHGTTVVKVKYEGTGAPCKVPIEIR DVNKEKVVGRIISSTPLAENTNSVTNIELEPPFGDSYIVIGIGDKALKINWFRKGSSI GKMFEATARGARRMAILGTAWDFGSVGGLLTSLGKAVIIQVFGSVYTTMFGGVSWMVR ILIGFLVLWIGTNSRNTSMAMSCIAVGGITLFLGFTVHA 245. 28D11 MRSTITLLCLIPTVMAFSLSTRDGEPLMIVGKNERGKSLLFKTASGINMCTLIAMDLG (C15/full EMCEDTVTYKCPLLVNTEPEDIDCWCNLTSTWVMYGTCTQAGEHRRDKRSVALAPHVG length MGLDTRTQTWMSSECAWKQIQKVETWALRHPGFTVIALFLAHAIGTSITQKGIIFILL prM/full MLVTPSMAMRCVGIGNRDFVEGLSGATWVDVVLEHGScVTTMAKNKPTLDIELQKTEA length E) TQLATLRKLCIEGKISNTTTDSRCPTQGEATLPEEQDQNYVCKHTYVDRGWGNGCGLF GKGSLVTCAKFQCLEPIEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGVTVEITPQA STVEAILPEYGTLCLEcSPRTGLDFNEMILLTMKNKAWMVHRQWFFDLPLPWTSGATT ETPTWNRKELLVTFKNAHAKKQEVVVLGSQEGAMHTALTGATEIQTSGTTTIFAGHLK CRLKMDKLTLKGVSYVMCTGSFKLEKEVAETQHGTVLVQVKYEGTDAPCKIPFSSQDE KGVTQNGRLITANPVVTKKEEPVNIEAEPPFGESNIVIGIGDSALTLHWFRKGSSIGK MFESTYRGAKRMAILGETAWDFGSVGGLLTSLCKAVHQVFGSVYTTMFGGVSWMVRIL IGFLVLWIGTNSRNTSMAIASCIAVCVITLYLGAMVQA 246. 28E12 MKTSLCLMMMLPATLAFHLTSRDGEPRMIVGKNERGKSLLFKTTEGINKCTLIAMDLG (C15/full EMCEDTVTYKCPLLVNTEPEDIDCWCNLTSTWVTYGTCTTTCEHRREKRSVALDPHVG length LGLETRTETWMSSEGAWRQVEKVETWALRHPGFTILALFLAHYIGTSLTQKVVIFILL prM/full MLVTPSMTMRCVGAGNRDFVEGLSGATWVDVVLEHGSCVTTMAKNKPTLDFELIKTEA length E) TQPATLRKYCIEAKITNITTDSRCPTQGEAILNEEQDKRFVCKHSMVDRGWGIGCGLF GKGGVVTCAMFQCLEPIEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGVTVEITPQA STVEAILPEYGTLGLEcSPRTGLDFNEMILLTMKNKAWMVHRQWFFDLPLPWTSGATT ETPTWNRKELLVTFKNAHAKQEVVVRGSQEGAMHSALAGATEVDSGDGNIIMFAGHLK CRLRMDKLQLKGMSYSMCTGKFQIVKEIAETQHGTIVIRVQYEGDGSPCKIPLEIMDL EKRFIVLGRLITVNPIVTEKDSPVNIEAEPPFGDSYIIIGVGDSAITLHWRKGSSIGK MFESTYRGARRMAILGDTAWDFGSIGGVFTSVGKLIHQIFGTAYGVLFSGVSWTMKIG IGILLTWLGMNSRSTSLSVSLVLVGVITLYLGAMVQA 247. 28F9 MNRRRRTVGVIIMLIPTAMAFHLTTRNGEPRMIVAKHERGRPLLFKTTEGINKCTLIA (C15/full MDLGEMCEDTVTYKCPLLVNTEPEDIDCWCNLTSTWVTYGTCSQTGEHRREKRSVALD length PHVGLGLETRTETWMNSEGAWKQIQKVETWALRHPGFTVTALFLAHAIGTSITQKGII prM/full FILLMLVAPSYGMRCVGVGNRDFVEGVSGGAWVDLVLEHGGCVTTMAKNKPTLDFELI length E) KTEATQPATLRKYcTEAKLTNTTTDSRCPTQGEPSLKEEQDQQYICRRDVVDRGWGNG CGLFGKGSLVTCAKFQCLEPIEGKVVQHENLKYTVIITVITGDQHQVGNDTSNHGVTA TITPRSPSVETKLPDYGELTLDCEPRSGIDFNEMILHKMKKKTWLVHKQWFLDLPLPW TAGADTSEVHWNHKERIVTFKVPHAKRQDVTVLGSQEGAMHSALAGATEVDSGDGNHM FAGHLKCKVRIEKLRIKGMSYTMCSGKFSIDKEMAETQHGTTVVKVKYEGTGAPCKVP IEIRDVNKEKVVGRIISSTPLAENTNSVTNIELEPPFGDSYIVIGVGDSALTLHWFKK GSSIGKMFEATARGARRMAILGETAWDFGSVGGLLTSLGKAVHQVPGSVYTTMFCGVS WMVRILIGFLVLWIGTNSRNTSMAMScTAVGGITLFLGFTVHA 248. 28H3 MRPTITLLCLMMMLPATLAFHLTTRNGEPHMIVGRQEKGKSLLFKTEDGVNMCTLMAI (c15/full DLGELCEDTITYKCPLLRQNEPEDIDCWCNSTSTWVTYGTCTTTGEHRREKRSVALVP length HVGMGLETRTETWMSSEGAWKHAQRVESWTLPNPRFALLAGFMAYMIGQTGIQRTVFF prM/full VLMMLVAPSYGMPCVGVGNRDFVEGLSGATWVDVVLEHGScVTTMAKNKPTLDIELLK length E) TEVTNPAVLRKYcTEAKLTNTTTESRCPTQGEAILPEEQDQNYVCKHTYVDRGWGNGC GLFGKGSLVTCAKFQCLEPTEGVVQHENLKYTVIITVIITGDQHQVGNDTQGVTVEIT PQASTVEAILPEYGTLGLEcSPRTCLDFNRVVLLTMKNKAWMVHRQWFFDLPLPWTSG ATTETPTWNRKELLVTFKNAHAKKQEVVVLGSQEGAMHTALTGATETQMSSGNLLFTG HLKCRLRMDKLQLKGMSYTMCSGKFQIVKEIAETQHGTIVIRVQYEGDGSPCKIPLEI MDLEKRHVLGRLITVNPIVTDKEKPVNIEAEPPFGDSYTIIGVEPGQLKLHWFKKGSS IGQMFETTMRGAKRMAILGDTAWDFGSLGGVPTSIGKALHQVFGAIYGAAFSGVSWTM KILIGVIITWIGMNSRSTSLSVSLVLVGVTTLYLGAMVQA 249. 28H9 MRSTTTLLCLIPTVMAFSLSTRDGEPLMTVAKHERGRPLLFKTTEGTNKCTLIAMDLG (C15/full EMCEDTVTYKCPLLVNTEPEDIDCWCNLTSTWVMYGTCTQSGERRREKRSVALVPHVG length MGLETRTETWMSAEGAWRQVEKVETWALRHPGFTILALFLARYTGTSLTQKVVIFILL prM/full MLVTPSMTMRCVGVGNRDFVEGVSGGSWVDTVLEHGScVTTMAKNKPTLDFELIKTEA length E) TQPATLRKYcTEAKLTNTTTESRCPTQGEPSLNEEQDKRFVCKHSMVDRGWGNGCGLF GKGSLVTCAKFQCLEPIEGKVVQHENLKYTVITTVHTGDQHQVGNDTQGVTVEITPQA STVEAILPEYGTLGLEcSPRTGLDFNEMILLTMKNKAWMVHRQWFFDLPLPWTAGADT SEVHWNHKERMVTFKVPHAKRQDVTVLGSQEGAMHTALTGTTETQNSGGTSIFAGHLK CRLKMDKLELKGMSYAMCLNTFVLKKEVSETQHGTILIKVEYKGEDAPCKIPFSTEDG QGKAHNGRLITANPVVTKKEEPVNIEAEPPFGESNIVIGIGDKALKINWYKKGSSIGK MFESTYRGAKRMAILGETAWDFGSVGGVLNSLGKMVHQIFGSAYTALFSGVSWTMKIL IGVIITWIGMNSRSTSLSVSLVLVGVTTLYLGAVQA 250. 16G11-D4 MRSTITLLCLTPTVMAFSLSTRDGEPLMTVAKHERGRPLLFKTTEGINKCTLIAMDLG (16G11) EMCEDTVTYKCPLLVNTEPEDIDCWCNLTSTWVMYGTCTQSGERRREKRSVALTPHSG extended MGLETRAETWMSSEGAWKHAQRVESWILRNPRFALLAGFMAYMIGQTGIQRTVFFVLM to MLVAPSYGMRCVCVGNRDFVEGVSGATWVDVVLEHGGCVTTMAKNKPTLDIELLKTEV C15/full TNPAVLRKLCIEASISNTTTDSRCPTQGEATLVEEQDTNFVCRRTFTDRGWGNGCGLF length GKGSVVTCAKFQCLEPIEGKVVQHENLKYTVIITTITGDQHQVGNDTQGVTVEITPQA prM/ STVEAILPEYGTLGLEcSPRTGLDFNEMILLTMKNKAWLTIKQWFLDLPLPWTAGADT full SEVHWNHKERMVTFKTAHAKKQDVVVLGSQEGANHTALTGATEIQMSSGNLLFTGHLK length E CRLRMDKLQLKGMSYVMCTGKFQIVKEIAETQHGTIVIRVQYEGDGAPCKIPFSTEDG with WT QGKAHNGRLITANPIVIDKEKPVNIELEPPPGDSYTVVGAcDKALKINWYKKGSSICK Den-4 E MFEATARGARRMAILGETAWDFGSVGGLLTSLGKAVHQVFGSVYTTMFGGVSWMVRIL protein IGFLTLWIGTNSRNTSMAMScTAVGGITLFLGFTVHA residues) 251. 16G11- MRSVTMILMLLPTTLAFHLTSRDGEPRMIVAKHERGRPLLFKTTEGINKCTLIAMDLG 25B10ext EMCDDTVTYKCPHITEVEPEDTDCWCNSTSTWVTYGTCNQAGEHRRDKRSVALAPHVG (16G11 MGLDTRTQTWMSAEGAWKHAQRIETWILRHPGFIIMAAILAYTIcTTHFQRALIFILL extended TAAAPSMTMRCIGISNRDFVEGVSGATWVDVVLEHGGCVTTMAKNKPTLDIELLKTEV to TNPAVLRKLCIEASISNTTTDSRCPTQGEATLTEEQDTNFVCRRTFVDRGWGNGCGLF C15/full GKGSVVTCAKFQCLEPIEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGVTVEITPQA length STVEAILPEYGTLGLEcSPRTGLDFNEMILLTMKNKAWLVHKQWFLDLPLPWTAGADT prM/ SEVHWNHKERIVTFKTAHAKKQDVVVLGSQEGAMHTALTGATEIQMSSGNLLFTGHLK full CRLRMDKLQLKGMSYVMCTGKFQIVKEIAETQHGTIVIRVQYEGDGAPCKIPFSTEDG length E QGKAHNGRLITANPIVIDKEKPVNIELEPPFGDSYIVVGAGDKALKINWYKKGSSIGK with MFEATARGARPIAILGDTAWDFGSIGGVFTSIGKALHQVPGSVYTTMFGGVSWMVRIL 25B10 IGFLVLWTGTNSRSTSLSMTCIAVGMVTLYLGVMVQA residues of E protein) 252. 18H6-D4 MRSTITLLCLIPTVMAFSLSTRDGEPLMIVAKIERGRPLLFKTTEGINKCTLIAMDLG (18H6) EMCEDTVTYKCPLLVNTEPEDIDCWCNLTSTWVMYGTCTQSGERRREKRSVALTPHSG extended MGLETRAETWMSSEGAWKHAQRVESWILRNPRFALLAGFMAYMIGQTGIQRTVFFVLM to MLVAPSYGMPCVGVGNRDFVEGVSGATWVDVVLEHGGCVTTMAKNKPTLDFELIKTTA C15/full KEVALLRTYCIEASISNITTATRCPTQGEATLPEEQDQNYVCKHTYVDRGWGNGCGLF length GKGSLVTCAKFTCKKNMEGNIVQPENLEYTTVTTPHTGDQHQVGNDTQGVTVEITPQA prM/ STVEAILPEYGTLGLEcSPRTGLDFNRVVLLTMKKKSWLVHKQWFLDLPLPWTAGADT full SEVHWNHKERIVTFKNAHAKRQDVTVLGSQEGAIHSALAGATETQMSSGNLLFTGHLK length E CRLKNDKLQLKGVSYVMCTGSFKLEKEVAETQHGTVLVQVKYEGTDAPCKIPFSTEDG with WT QGKAHNGRLITVNPTVTDKEKPVNTELEPPFGESYIVVGAGEKALKLSWFKKGSSIGK Den-4 E MFEATARGAKRMAILGETAWDFGSVGGLLTSLGKAVHQVFGSVYTTMFGGVSWMVRIL protein IGFLVLWIGTNSPNTSMAMSCIAVGGITLFLGFTVHA residues) 253. 18H6- MRSVTMILMLLPTTLAFHLTSRDGEPRMTVAKHERGRPLLFKTTEGINKCTLIAMDLG 25B10ext EMCDDTVTYKCPHTTEVEPEDTDCWCNSTSTWVTYGTCNQAGEHRRDKRSVALAPHVG (18H6 MGLDTRTQTWMSAEGAWKHAQRIETWTLRHPGFIIMAAILAYTTGTTHFQPALIFILL extended TAAAPSMTMRcIGTSNRDFVEGVSGATWVDVVLEHGGCVTTMAKNKPTLDFELIKTTA to KEVALLRTYcTEASISNITTATRCPTQGEAILPEEQDQNYVCKHTYVDRGWGNGCGLF C15/full GKGSLVTCAKFTCKKNMEGNIVQPENLEYTIVITPHTGDQHQVGNDTQGVTVEITPQA length E STVEATLPEYGTLGLEcSPRTGLDFNRVVLLTMKKKSWLVHKQWFLDLPLPWTAGADT prM/ SEVHWNHKEPIVTFKNAHAKRDVTVLGSQEGAIMHSALACATEIQMSSGNLLFTGHLK full CRLKMDKLQLKGVSYVMCTGSFKLEKEVAETQHGTVLVQVKYEcTDAPCKTPFSTEDG length E QGKAHNGRLTTVNPTVIDKEKPVNIELEPPFGESYTVVGAGEKALKLSWFKKGSSIGK with MFEATARGAKRMATLGDTAWDFGSIGGVFTSIGKALHQVFGSVYTTMFGGVSWMVRIL 25E10 IGFLVLWIGTNSRSTSLSMTCIAVGMVTLYLGVMVQA residues of E protein) 254. >16G11-D4 atgagatctaccatcaccctgctctgcctgatccctactgttatggccttctctctgt (16G11 caacaagagatggcgagcctctcatgatcgtggccaagcacgaaagagggaggcctct extended gctgttcaagactactgaagggatcaataagtgcactctcatcgccatggacctgggc to gagatgtgtgaggataccgtgacctacaagtgtccactgctggtcaacaccgaacccg C15/full aggatatcgattgctggtgcaatctgacttctacttgggtgatgtatgggacctgtac length ccagtccggagagagaaggagggagaagaggtccgtcgccctgactcctcactctggt prM/ atgggcctggaaaccagagccgagacatggatgagctctgagggagcttggaagcacg full cccagagggttgaatcttggattctgcgcaatccacgctttgcactcctggctggctt length E catggcttatatgattgggcagactggtattcagaggaccgtcttcttcgttctgatg with WT atgctcgtggcaccatcttacggcatgaggtgcgtgggcgtgggcaacagggacttcg Den-4 E tggagggcgtgagcggcgccacctgggtggacgtggtgctggagcacggcggctgcgt protein gaccaccatggccaagaacaagcccaccctggacatcgagctgctgaagaccgaggtg residues) accaaccccgccgtgctgaggaagctgtgcatcgaggccagcatcagcaacaccacca ccgacagcaggtgccccacccagggcgaggccaccctggtggaggagcaggacaccaa cttcgtgtgcaggaggaccttcgtggacaggggctggggcaacggctgcggcctgttc ggcaagggcagcgtggtgacctgcgccaagttccagtgcctggagcccatcgagggca aggtggtgcagcacgagaacctgaagtacaccgtgatcatcaccgtgcacaccggcga ccagcaccaggtgggcaacgacacccagggcgtgaccgtggagatcaccccccaggcc agcaccgtggaggccatcctgcccgagtacggcaccctgggcctggagtgcagcccca ggaccggcctggacttcaacgagatgatcctgctgaccatgaagaacaaggcctggct ggtgcacaagcagtggttcctggacctgcccctgccctggaccgccggcgccgacacc agcgaggtgcactggaaccacaaggagaggatggtgaccttcaagaccgcccacgcca agaagcaggacgtggtggtgctgggcagccaggagggcgccatgcacaccgccctgac cggcgccaccgagatccagatgagcagcggcaacctgctgttcaccggccacctgaag tgcaggctgaggatggacaagctgcagctgaagggcatgagctacgtgatgtgcaccg gcaagttccagatcgtgaaggagatcgccgagacccagcacggcaccatcgtgatcag ggtgcagtacgagggcgacggcgccccctgcaagatccccttcagcaccgaggacggc cagggcaaggcccacaacggcaggctgatcaccgccaaccccatcgtgatcgacaagg agaagcccgtgaacatcgagctggagccccccttcggcgacagctacatcgtggtggg cgccggcgacaaggccctgaagatcaactggtacaagaagggcagcagcatcggcaag atgttcgaggccaccgccaggggcgccaggaggatggccatcctgggcgagaccgcct gggacttcggcagcgtgggcggcctgctgaccagcctgggcaaggccgtgcaccaggt gttcggcagcgtgtacaccacaatgtttggcggcgtctcttggatggtgagaatcctg atcgggttcctcgtcctgtggattggaaccaatagcaggaatacaagcatggccatga gctgcatcgctgttggcggcatcacactcttcctgggtttcaccgttcacgca 255. >16G11- atgagatctgtgaccatgattctcatgctgctgcctactactctggccttccatctga 25b10ext catcaagagatggcgagcctaggatgatcgtggccaagcacgaaagagggaggcctct (16G11 gctgttcaagactactgaagggatcaataagtgcactctcatcgccatggacctgggc extended gagatgtgtgacgataccgtgacctacaagtgtccacacatcaccgaggtcgaacccg to aggatatcgattgctggtgcaattcaacttctacttgggtgacctatgggacctgtaa C15/full ccaggctggcgagcataggagggataagaggtccgtcgccctggctcctcacgttggt length atgggcctggacaccagaacccagacatggatgagcgctgagggagcttggaagcacg prM/ cccagaggatcgagacctggattctgcgccatccaggctttatcatcatggctgccat full cctggcttatacaattgggaccactcacttccagagagccctcatcttcattctgctc length E accgccgcggcaccatctatgaccatgaggtgcatcggcatcagcaacagggacttcg with tggagggcgtgagcggcgccacctgggtggacgtggtgctggagcacggcggctgcgt 25B10 E gaccaccatggccaagaacaagcccaccctggacatcgagctgctgaagaccgaggtg protein accaaccccgccgtgctgaggaagctgtgcatcgaggccagcatcagcaacaccacca residues) ccgacagcaggtgccccacccagggcgaggccaccctggtggaggagcaggacaccaa cttcgtgtgcaggaggaccttcgtggacaggggctggggcaacggctgcggcctgttc ggcaagggcagcgtggtgacctgcgccaagttccagtgcctggagcccatcgagggca aggtggtgcagcacgagaacctgaagtacaccgtgatcatcaccgtgcacaccggcga ccagcaccaggtgggcaacgacacccagggcgtgaccgtggagatcaccccccaggcc agcaccgtggaggccatcctgcccgagtacggcaccctgggcctggagtgcagcccca ggaccggcctggacttcaacgagatgatcctgctgaccatgaagaacaaggcctggct ggtgcacaagcagtggttcctggacctgcccctgccctggaccgccggcgccgacacc agcgaggtgcactggaaccacaaggagaggatggtgaccttcaagaccgcccacgcca agaagcaggacgtggtggtgctgggcagccaggagggcgccatgcacaccgccctgac cggcgccaccgagatccagatgagcagcggcaacctgctgttcaccggccacctgaag tgcaggctgaggatggacaagctgcagctgaagggcatgagctacgtgatgtgcaccg gcaagttccagatcgtgaaggagatcgccgagacccagcacggcaccatcgtgatcag ggtgcagtacgagggcgacggcgccccctgcaagatccccttcagcaccgaggacggc cagggcaaggcccacaacggcaggctgatcaccgccaaccccatcgtgatcgacaagg agaagcccgtgaacatcgagctggagccccccttcggcgacagctacatcgtggtggg cgccggcgacaaggccctgaagatcaactggtacaagaagggcagcagcatcggcaag atgttcgaggccaccgccaggggcgccaggaggatggccatcctcggcgatacagcct gggattttggctcaatcggaggggtgttcaccagcatcggcaaggccctgcaccaggt tttcggcagcgtgtacaccacaatgtttggcggcgtctcttggatggtgagaatcctg atcgggttcctcgtcctgtggattggaaccaatagcaggagcacaagcctgagcatga cctgcatcgctgttggcatggtcacactctacctgggtgtgatggttcaggca 256. >18H6-D4 atgagatctaccatcaccctgctctgcctgatccctactgttatggccttctctctgt (18H6) caacaagagatggcgagcctctcatgatcgtggccaagcacgaaagagggaggcctct extended gctgttcaagactactgaagggatcaataagtgcactctcatcgccatggacctgggc to gagatgtgtgaggataccgtgacctacaagtgtccactgctggtcaacaccgaacccg C15/full aggatatcgattgctggtgcaatctgacttctacttgggtgatgtatgggacctgtac length ccagtccggagagagaaggagggagaagaggtccgtcgccctgactcctcactctggt prM/ atgggcctggaaaccagagccgagacatggatgagctctgagggagcttggaagcacg full cccagagggttgaatcttggattctgcgcaatccacgctttgcactcctggctggctt length E catggcttatatgattgggcagactggtattcagaggaccgtcttcttcgttctgatg with WT atgctcgtggcaccatcttacggcatgaggtgcgtgggcgtgggcaacagggacttcg Den-4 E tggagggcgtgagcggcgccacctgggtggacgtggtgctggagcacggcggctgcgt protein gaccaccatggccaagaacaagcccaccctggacttcgagctgatcaagaccaccgcc residues) aaggaggtggccctgctgaggacctactgcatcgaggccagcatcagcaacatcacca ccgccaccaggtgccccacccagggcgaggccatcctgcccgaggagcaggaccagaa ctacgtgtgcaagcacacctacgtggacaggggctggggcaacggctgcggcctgttc ggcaagggcagcctggtgacctgcgccaagttcacctgcaagaagaacatggagggca acatcgtgcagcccgagaacctggagtacaccatcgtgatcaccccccacaccggcga ccagcaccaggtgggcaacgacacccagggcgtgaccgtggagatcaccccccaggcc agcaccgtggaggccatcctgcccgagtacggcaccctgggcctggagtgcagcccca ggactggcctggacttcaacagggtggtgctgctgaccatgaagaagaagagctggct ggtgcacaagcagtggttcctggacctgcccctgccctggaccgccggcgccgacacc agcgaggtgcactggaaccacaaggagaggatggtgaccttcaagaacgcccacgcca agaggcaggacgtgaccgtgctgggcagccaggagggcgccatgcacagcgccctggc cggcgccaccgagatccagatgagcagcggcaacctgctgttcaccggccacctgaag tgcaggctgaagatggacaagctgcagctgaagggcgtgagctacgtgatgtgcaccg gcagcttcaagctggagaaggaggtggccgagacccagcacggcaccgtgctggtgca ggtgaagtacgagggcaccgacgccccctgcaagatccccttcagcaccgaggacggc cagggcaaggcccacaacggcaggctgatcaccgtgaaccccatcgtgatcgacaagg agaagcccgtgaacatcgagctggagccccccttcggcgagagctacatcgtggtggg cgccggcgagaaggccctgaagctgagctggttcaagaagggcagcagcatcggcaag atcgggttcctcgtcctgtggattggaaccaatagcaggaatacaagcatggccatga gggacttcggcagcgtgggcggcctgctgaccagcctgggcaaggccgtgcaccaggt gttcggcagcgtgtacaccacaatgtttggcggcgtctcttggatggtgagaatcctg atcgggttcctcgtcctgtggattggaaccaatagcaggaatacaagcatggccatga gctgcatcgctgttggcggcatcacactcttcctgggtttcaccgttcacgca 257. >18H6- atgagatctgtgaccatgattctcatgctgctgcctactactctggccttccatctga 25B10ext catcaagagatggcgagcctaggatgatcgtggccaagcacgaaagagggaggcctct (18H6 gctgttcaagactactgaagggatcaataagtgcactctcatcgccatggacctgggc extended gagatgtgtgacgataccgtgacctacaagtgtccacacatcaccgaggtcgaacccg to aggatatcgattgctggtgcaattcaacttctacttgggtgacctatgggacctgtaa C15/full ccaggctggcgagcataggagggataagaggtccgtcgccctggctcctcacgttggt length atgggcctggacaccagaacccagacatggatgagcgctgagggagcttggaagcacg prM/ cccagaggatcgagacctggattctgcgccatccaggctttatcatcatggctgccat full cctggcttatacaattgggaccactcacttccagagagccctcatcttcattctgctc length E accgccgcggcaccatctatgaccatgaggtgcatcggcatcagcaacagggacttcg with tggagggcgtgagcggcgccacctgggtggacgtggtgctggagcacggcggctgcgt 25B10 gaccaccatggccaagaacaagcccaccctggacttcgagctgatcaagaccaccgcc residues aaggaggtggccctgctgaggacctactgcatcgaggccagcatcagcaacatcacca of E ccgccaccaggtgccccacccagggcgaggccatcctgcccgaggagcaggaccagaa protein) ctacgtgtgcaagcacacctacgtggacaggggctggggcaacggctgcggcctgttc ggcaagggcagcctggtgacctgcgccaagttcacctgcaagaagaacatggagggca acatcgtgcagcccgagaacctggagtacaccatcgtgatcaccccccacaccggcga ccagcaccaggtgggcaacgacacccagggcgtgaccgtggagatcaccccccaggcc agcaccgtggaggccatcctgcccgagtacggcaccctgggcctggagtgcagcccca ggactggcctggacttcaacagggtggtgctgctgaccatgaagaagaagagctggct ggtgcacaagcagtggttcctggacctgcccctgccctggaccgccggcgccgacacc agcgaggtgcactggaaccacaaggagaggatggtgaccttcaagaacgcccacgcca agaggcaggacgtgaccgtgctgggcagccaggagggcgccatgcacagcgccctggc cggcgccaccgagatccagatgagcagcggcaacctgctgttcaccggccacctgaag tgcaggctgaagatggacaagctgcagctgaagggcgtgagctacgtgatgtgcaccg gcagcttcaagctggagaaggaggtggccgagacccagcacggcaccgtgctggtgca ggtgaagtacgagggcaccgacgccccctgcaagatccccttcagcaccgaggacggc cagggcaaggcccacaacggcaggctgatcaccgtgaaccccatcgtgatcgacaagg agaagcccgtgaacatcgagctggagccccccttcggcgagagctacatcgtggtggg cgccggcgagaaggccctgaagctgagctggttcaagaagggcagcagcatcggcaag atgttcgaggccaccgccaggggcgccaagaggatggccatcctcggcgatacagcct gggattttggctcaatcggaggggtgttcaccagcatcggcaaggccctgcaccaggt tttcggcagcgtgtacaccacaatgtttggcggcgtctcttggatggtgagaatcctg atcgggttcctcgtcctgtggattggaaccaatagcaggagcacaagcctgagcatga cctgcatcgctgttggcatggtcacactctacctgggtgtgatggttcaggca 258. >25B6 atgagatctaccatcaccctgctctgcctgatccctactgttatggccttctctctgt (C15/full caacaagagatggcgagcctctcatgatcgtggccaagcacgaaagagggaggcctct length gctgttcaagactactgaagggatcaataagtgcactctcatcgccatggacctgggc prM/full gagctgtgtgaggataccatcacctacaagtgtccactgctgaggcaggacgaacccg length E) aggatatcgattgctggtgcaatgcaactgatacttgggtgacctatgggacctgtaa ccaggctggcgagcataggagggataagaggtccgtcgccctgactcctcactctggt atgggcctggaaaccagagccgagacatggatgagctctgagggagcttggaagcaga tccagaaggttgagacttgggctctgcgccatccaggctttacagtcattgccctgtt cctggctcatgcaattgggacctctattactcagaagggcatcatcttcattctgctc atgctcgtgacaccatctatggccatgaggtgcgtgggcgtgggcaacagggacttcg tggagggcctgagcggcgccacctgggtggacgtggtgctggagcacggcggctgcgt gaccaccatggccaagaacaagcccaccctggacatcgagctgcagaagaccgaggcc acccagctggccaccctgaggaagctgtgcatcgagggcaagatcaccaacatcacca ccgacagcaggtgccccacccagggcgaggccatcctgcccgaggagcaggaccagca gtacatctgcaggagggacgtggtggacaggggctggggcaacggctgcggcctgttc ggcaagggcagcctggtgacctgcgccaagttccagtgcctggagcccatcgagggca aggtggtgcagcacgagaacctgaagtacaccgtgatcatcaccgtgcacaccggcga ccagcaccaggtgggcaacgacacccagggcgtgaccgtggagatcaccccccaggcc agcaccgtggaggccatcctgcccgagtacggcaccctgggcctggagtgcagcccca ggaccggcctggacttcaacgagatgatcctgctgaccatgaagaacaaggcctggat ggtgcacaagcagtggttcctggacctgcccctgccctggaccagcggcgccagcacc agccaggagacctggaacaggcaggacctgctggtgaccttcaagaccgcccacgcca agaagcaggaggtggtggtgctgggcagccaggagggcgccatgcacaccgccctgac cggcgccaccgagatccagatgagcagcggcaacctgctgttcaccggccacctgaag tgcaggctgaagatggacaagctggagctgaagggcatgagctacgccatgtgcctga acaccttcgtgctgaagaaggaggtggccgagacccagcacggcaccgtgctggtgca ggtgaagtacgagggcaccgacgccccctgcaagatccccttcagcagccaggacgag aagggcgtgacccagaacggcaggctgatcaccgccaaccccatcgtgatcgacaagg agaagcccgtgaacatcgaggccgagccccccttcggcgagagctacatcgtgatcgg catcggcgacaaggccctgaagatcaactggtacaagaagggcagcagcatcggcaag atgttcgaggccaccgccaggggcgccaggaggatggccatcctcggcgatacagcct gggattttggctcaatcggaggggtgttcaccagcgtgggtaaactgattcatcagat tttcggtacagcatatggcgtgctgttttccggcgtctcttggaccatgaagatcggt attggcatcctcctcacatggctgggactgaatagcaggagcacaagcatgagcttta gctgcatcgctattggcatcatcacactctacctgggtgtggtggttcaggca 259. >25B10 atgagatctgtgaccatgattctcatgctgctgcctactactctggccttccatctga (C15/full catcaagagatggcgagcctaggatgatcgtggccaagcacgaaagagggaggcctct length gctgttcaagactactgaagggatcaataagtgcactctcatcgccatggacctgggc prM/full gagatgtgtgacgataccgtgacctacaagtgtccacacatcaccgaggtcgaacccg length E) aggatatcgattgctggtgcaattcaacttctacttgggtgacctatgggacctgtaa ccaggctggcgagcataggagggataagaggtccgtcgccctggctcctcacgttggt atgggcctggacaccagaacccagacatggatgagcgctgagggagcttggaagcacg cccagaggatcgagacctggattctgcgccatccaggctttatcatcatggctgccat cctggcttatacaattgggaccactcacttccagagagccctcatcttcattctgctc accgccgcggcaccatctatgaccatgaggtgcatcggcatcagcaacagggacttcg tggagggcgtgagcggcggcagctgggtggacatcgtgctggagcacggcagctgcgt gaccaccatggccaagaacaagcccaccctggacatcgagctgctgaagaccgaggtg accaaccccgccgtgctgaggaagctgtgcatcgaggccaagatcagcaacaccacca ccgacagcaggtgccccacccagggcgaggccaccctgcccgaggagcaggaccagaa ctacgtgtgcaagcacacctacgtggacaggggctggggcaacggctgcggcctgttc ggcaagggcagcctggtgacctgcgccaagttccagtgcctggagcccatcgagggca aggtggtgcagcacgagaacctgaagtacaccgtgatcatcaccgtgcacaccggcga ccagcaccaggtgggcaacgacacccagggcgtgaccgtggagatcaccccccaggcc agcaccgtggaggccatcctgcccgagtacggcaccctgggcctggagtgcagcccca ggaccggcctggacttcaacgagatgatcctgctgaccatgaagaacaaggcctggat ggtgcacaggcagtggttcttcgacctgcccctgccctggaccagcggcgccaccacc gagacccccacctggaacaggaaggagctgctggtgaccttcaagaccgcccacgcca agaagcaggaggtggtggtgctgggcagccaggagggcgccatgcacagcgccctggc cggcgccaccgaggtccagatgagcagcggcaacctgctgttcaccggccacctgaag tgcaggctgaggatggacaagctgcagctgaagggcatgagctacagcatgtgcaccg gcaagttccagatcgtgaaggagatcgccgagacccagcacggcaccatcgtgatcag ggtggagtacaagggcgaggacgccccctgcaagatccccttcagcaccgaggacggc cagggcaaggcccacaacggcaggctgatcaccgccaaccccgtggtgaccaagaagg aggagcccgtgaacatcgaggccgagccccccttcggcaagagcaacatcgtgatcgg catcggcgacaaggccctgaagatcaactggtacaagaagggcagcagcatcggcaag atgttcgaggccaccgccaggggcgccaggaggatggccatcctcggcgatacagcct gggattttggctcaatcggaggggtgttcaccagcatcggcaaggccctgcaccaggt tttcggcagcgtgtacaccacaatgtttggcggcgtctcttggatggtgagaatcctg atcgggttcctcgtcctgtggattggaaccaatagcaggagcacaagcctgagcatga cctgcatcgctgttggcatggtcacactctacctgggtgtgatggttcaggca 260. 25D4 atgagatctgtgaccatgattctcatgctgctgcctactgctctggccttccatctga (C15/full caacaagaggtggcgagcctaccctgatcgtgtccaagcaagaacgcggcaagagcct length gctgttcaagacttctgcagggatcaatatgtgcactctcatcgccatggacctgggc prM/full gagatgtgtgacgataccgtgacctacaagtgtccacacatcaccgaggtcgaacccg length E) aggatatcgattgctggtgcaatctgacttctacttgggtgacctatgggacctgtaa ccaggctggcgagcataggagggataagaggtccgtcgccctggttcctcacgttggt atgggcctggaaaccagaaccgagacatggatgagctctgagggagcttggaagcacg cccagaggatcgagacctggattctgcgccatccaggctttacaatcctggccctgtt cctggctcattacattgggacctctctgactcagaaggtcgtcatcttcattctgctc atgctcgtgacaccatctatgaccatgaggtgcatcggcatcagcaacagggacttcg tggagggcctgagcggcgccacctgggtggacgtggtgctggagcacggcagctgcgt gaccaccatggccaagaacaagcccaccctggacttcgagctgatcaagaccgaggcc acccagcccgccaccctgaggaagtactgcatcgaggccaagctgaccaacatcacca ccgacagcaggtgccccacccagggcgaggccatcctgcccgaggagcaggaccagaa ctacgtgtgcaagcacacctacgtggacaggggctggggcaacggctgcggcctgttc ggcaagggcagcctggtgacctgcgccaagttccagtgcctggagcccatcgagggca aggtggtgcagcacgagaacctgaagtacaccgtggtggtgaccgtgcacaacggcga cacccacgccgtgggcaacgacaccagcaaccacggcgtgaccgccaccatcaccccc aggagccccagcgtggaggtgaagctgcccgactacggcgagctgaccctggactgcg agcccaggagcggcatcgacttcaacgagatgatcctgatgaagatgaagaagaagac ctggctggtgcacaagcagtggttcctggacctgcccctgccctggaccgccggcgcc gacaccagcgaggtgcactggaaccacaaggagaggatggtgaccttcaaggtgcccc acgccaagaggcaggacgtgaccgtgctgggcagccaggagggcgccatgcacaccgc cctgaccggcgccaccgagatccagatgagcagcggcaaccacatgttcgccggccac ctgaagtgcaggctgaggatggacaagctgcagctgaagggcatgagctacagcatgt gcaccggcaagttccagatcgtgaaggagatcgccgagacccagcacggcaccatcgt gatcagggtgcagtacgagggcgacggcagcccctgcaagatccccctggagatcatg gacctggagaagaggcacgtgctgggcaggctgatcaccgccaaccccgtggtgacca agaaggacagccccgtgaacatcgaggccgagccccccttcggcgacagctacatcat catcggcgtggagcccggccagctgaagctgcactggttcaagaagggcagcagcatc ggccagatgttcgagagcacctacaggggcgccaagaggatggccatcctgggcgaga ccgcctgggacttcggcagcgtgggcggcctgctgaccagcctgggcaaggccgtgca ccaggtgttcggcagcgtgtacaccacaatgtttggcggcgtctcttggatggtgaga atcctgatcgggttcctcgtcctgtggattggaaccaatagcaggaatacaagcatgg ccatgagctgcatcgctgttggcggcatcacactcttcctgggtttcaccgttcaggc a 261. >25E11 atgaaaactagcctctgcctgatgatgatgctgcctgctactctggccttccatctga (C15/full) catcaagagatggcgagcctaggatgatcgtgggcaagaacgaaagagggaagagcct length gctgttcaagactgcttcagggatcaatatgtgcactctcatcgccatggacctgggc prM/full gagatgtgtgacgataccgtgacctacaagtgtccacacatcaccgaagtcgaacccg length E) aggatatcgattgctggtgcaatctgacttctacttgggtgacctatgggacctgtac ccagtccggagagagaaggagggagaagaggtccgtcgccctgactcctcactctggt atgggcctggaaaccagaaccgagacatggatgagctctgagggagcttggaagcaga tccagaaggttgagacttgggctctgcgccatccaggctttacagtcattgccctgtt cctggctcatgcaattgggacctctattactcagaagggcatcatcttcattctgctc atgctcgtgacaccatctatggccatgaggtgcgtgggcgtgggcaacagggacttcg tggagggcctgagcggcgccacctgggtggacatcgtgctggagcacggctcctgcgt gaccaccatggccaagaacaagcccaccctggacttcgagctgatcaagaccgaggcc acccagcccgccaccctgaggaagctgtgcatcgagggcaagatcaccaacatcacca ccgacagcaggtgccccacccagggcgaggccatcctgcccgaggagcaggaccagaa ctacgtgtgcaagcacacctacgtggacaggggctggggcaacggctgcggcctgttc ggcaagggcagcctggtgacctgcgccaagttccagtgcctggagcccatcgagggca aggtggtgcagcacgagaacctgaagtacaccgtgatcatcaccgtgcacaccggcga ccagcaccaggtgggcaacgacacccagggcgtgaccgtggagatcaccccccaggcc agcaccgtggaggccatcctgcccgagtacggcgccctgaccctggactgcagcccca ggaccggcctggacttcaacagggtggtgctgctgaccatgaagaagaagagctggct ggtgcacaagcagtggttcctggacctgcccctgccctggaccagcggcgccagcacc agccaggagacctggaacaggcaggacctgctggtgaccttcaagaccgcccacgcca agaagcaggaggtggtggtgctgggcagccaggagggcgccatgcacaccgccctgac cggcaccaccgagatccagaacagcggcggcaccagcatcttcgccggccacctgaag tgcaggctgaagatggacaagctgaccctgaagggcgtgagctacgtgatgtgcaccg gcagcttcaagctggagaaggaggtgagcgagacccagcacggcaccatcctgatcaa ggtggagtacaagggcgaggacgccccctgcaaggtgcccatcgagatcagggacgtg aacaaggagaaggtggtgggcaggatcatcagcagcacccccctggccgagaacacca acagcgtgaccaacatcgagctggagccccccttcggcgacagctacatcgtgatcgg cgtgggcgacagcgccctgaccctgcactggttcaggaagggcagcagcatcggcaag atgttcgaggccaccgccaggggcgccaggaggatggccatcctcggcgatacagcct gggattttggctcaatcggaggggtgttcaccagcgtgggtaaactgattcatcagat tttcggtacagcatatggcgtgctgttttccggcgtctcttggaccatgaagatcggt attggcatcctcctcacatggctgggactgaatagcaggagcacaagcctgagcatga cctgcatcgctgttggcatggtcacactctacctgggtgtgatggttcaggca 262. >25H4 atgagatctgtgaccatgattctcatgctgctgcctactgctctggccttccatctga (C15/full caacaagaggtggcgagcctctcatgatcgtggccaagcacgaaagagggaagagcct length gctgttcaagactgcttcagggatcaatatgtgcactctcatcgccatggacctgggc prM/full gagatgtgtgacgataccgtgacctacaagtgtccacggatgaccgaggtcgaacccg length E) aggatatcgattgctggtgcaatctgacttctacttgggtgacctatgggacctgtac caccacaggcgagcataggagggagaagaggtccgtcgccctggttcctcacgttggt atgggcctggaaaccagaaccgagacatggatgagctctgagggagcttggaagcaga tccagaaggttgagacttgggctctgcgccatccaggctttacaatcctggccctgtt cctggctcattacattgggacctctctgactcaggaggtcgtcatcttcattctgctc atgctcgtgacaccatctatggccatgaggtgcgtgggcatcggcaaccgcgacttcg tggagggcctgagcggcgccacctgggtggacgtggtgctggagcacggcagctgcgt gaccaccatggccaagaacaagcccaccctggacatcgagctgctgaagaccgaggtg accaaccccgccgtgctgaggaagctgtgcatcgaggccaagatcagcaacaccacca ccgacagcaggtgccccacccagggcgaggccatcctgcccgaggagcaggaccagaa ctacgtgtgcaagcacacctacgtggacaggggctggggcaacggctgcggcctgttc ggcaagggcagcctggtgacctgcgccaagttccagtgcctggagcccatcgagggca aggtggtgcagcacgagaacctgaagtacaccgtgatcgtgaccgtgcacaccggcga ccagcaccaggtgggcaacgacacccagggcgtgaccgtggagatcaccccccaggcc agcaccgtggaggccatcctgcccgagtacggcaccctgggcctggagtgcagcccca ggaccggcctggacttcaacgagatgatcctgctgaccatgaagaacaaggcctggat ggtgcacaagcagtggttcctggacctgcccctgccctggaccagcggcgccagcacc agccaggagacctggaacaggcaggacctgctggtgaccttcaagaccgcccacgcca agaagcaggacgtggtggtgctgggcagccaggagggcgccatgcacaccgccctgac cggcgccaccgagatccagatgagcagcggcaacctgctgttcaccggccacctgaag tgcaggctgaggatggacaagctgcagctgaagggcatgagctacagcatgtgcaccg gcaagttccagatcgtgaaggagatcgccgagacccagcacggcaccatcgtgatcag ggtgcagtacgagggcgacggcagcccctgcaagatccccctggagatcatggacctg gagaagaggcacgtgctgggcaggctgatcaccgtgaaccccatcgtgaccgagaagg acagccccgtgaacatcgaggccgagccccccttcggcgacagctacatcatcatcgg cgtggagcccggccagctgaagctgcactggttcaagaagggcagcagcatcggccag atgttcgagaccaccatgaggggcgccaagaggatggccatcctgggcgacaccgcct gggacttcggcagcctgggcggcgtgttcaccagcatcggcaaggccctgcaccaggt tttcggtgcaatctatggcgcagccttttccggcgtctcttggaccatgaagatcctg atcggggtcatcatcacatggattggaatgaatagcaggagcacaagcctgagcgtta gcctcgtcctggttggcgtcatcacactctacctgggtgccatggttcaggca 263. >27A11 atgaaaactagcctctgcctgatgatgatgctgcctactgctctggccttccatctga (C15/full caacaagaggtggcgagcctaccctgatcgtgtccaagcaagaacgcggcaagagcct length gctgttcaagacttctgcaggggtgaatatgtgcactctcatcgccatggacctgggc prM/full gagctgtgtgaggataccatgacctacaagtgtccacggatgaccgaggccgaacccg length E) acgatgtggattgctggtgcaatgcaactgatacttgggtgacctatgggacctgtag ccagacaggcgagcataggagggagaagaggtccgtcgccctggttcctcacgttggt atgggcctggaaaccagaaccgagacatggatgagctctgagggagcttggaagcaga tccagaaggttgagacttgggctctgcgccatccaggctttacagtcattgccctgtt cctggctcatgcaattgggacctctattactcagaagggcatcatcttcattctgctc atgctcgtgacaccatctatggccatgaggtgcgtgggcatcggcaaccgcgacttcg tggagggcgtgagcggcggcagctgggtggacatcgtgctggagcacggctcctgcgt gaccaccatggccaagaacaagcccaccctggacatcgagctgctgaagaccgaggtg accaaccccgccgtgctgaggaagctgtgcatcgaggccaagatcagcaacaccacca ccgacagcaggtgccccacccagggcgaggccatcctgcccgaggagcaggaccagaa ctacgtgtgcaagcacacctacgtggacaggggctggggcaacggctgcggcctgttc ggcaagggcagcctggtgacctgcgccaagttccagtgcctggagcccatcgagggca aggtggtgcagcacgagaacctgaagtacaccgtgatcatcaccgtgcacaccggcga ccagcaccaggtgggcaacgacacccagggcgtgaccgtggagatcaccccccaggcc caccgtggaggccatcctgcccgactacggcgagctgaccctggactgcgagccca ggagcggcatcgacttcaacgagatgatcctgctgaccatgaagaacaaggcctggat ggtgcacaggcagtggttcttcgacctgcccctgccctggaccagcggcgccaccacc gagacccccacctggaacaggaaggagctgctggtgaccttcaagaacgcccacgcca aagcaggaggtggtggtgctgggcagccaggagggcgccatgcacaccgccctgac cggcaccaccgagatccagaacagcggcggcaccagcatcttcgccggccacctgaag tgcaaggtgaggatggagaagctgcgcatcaagggcatgagctacaccatgtgcagcg gcaagttcagcatcgacaaggagatggccgagacccagcacggcaccaccgtggtgaa ggtgaagtacgagggcaccggcgccccctgcaaggtgcccatcgagatcagggacgtg aacaaggagaaggtggtgggcaggatcatcagcagcacccccctggccgagaacacca acagcgtgaccaacatcgagctggagccccccttcggcgacagctacatcgtgatcgg cgtgggcgacagcgccctgaccctgcactggttcaggaagggcagcagcatcggcaag atgttcgagagcacctacaggggcgccaagaggatggccatcctgggcgagaccgcct gggacttcggcagcgtgggcggcctgctgaccagcctgggcaaggccgtgcaccaggt gttcggcagcgtgtacaccacaatgtttggcggcgtctcttggatggtgagaatcctg atcgggttcctcgtcctgtggattggaaccaatagcaggaatacaagcatggccatga gctgcatcgctgttggcggcatcacactcttcctgggtttcaccgttcacgca 264. >27G6 atgagatctaccatcaccctgctctgcctgatccctactgttatggccttctctctgt (C15/full caacaagagatggcgagcctctcatgatcgtggccaagcacgaaagagggaggcctct length gctgttcaagactactgaagggatcaataagtgcactctcatcgccatggacctgggc prM/full gagatgtgtgaggataccgtgacctacaagtgtccactgctggtcaacaccgaacccg length E) aggatatcgattgctggtgcaattcaacttctacttgggtgatgtatgggacctgtag ccagacaggcgagcataggagggataagaggtccgtcgccctggttcctcacgttggt atgggcctggaaaccagaaccgagacatggatgagctctgagggagcttggaagcaga tccagaaggttgagacttgggctctgcgccatccaggctttacagtcattgccctgtt cctggctcattacattgggacctctctgactcagaaggtcgtcatcttcattctgctc atgctcgtgacaccatctatgaccatgaggtgcgtgggcatcggcaaccgcgacttcg tggagggcctgagcggcgccacctgggtggacgtggtgctggagcacggcagctgcgt gaccaccatggccaagaacaagcccaccctggacatcgagctgctgaagaccgaggtg accaaccccgccgtgctgaggaagctgtgcatcgaggccaagatcagcaacaccacca ccgacagcaggtgccccacccagggcgagccctacctgaaggaggagcaggaccagca gtacatctgcaggagggacgtggtggacaggggctggggcaacggctgcggcctgttc ggcaagggcagcctgatcacctgcgccaagttcaagtgcgtgaccaagctggagggca agatcgtgcagtacgagaacctggagtacaccatcgtgatcaccccccacagcggcga ggagcacgccgtgggcaacgacaccagcaaccacggcgtgaccgccaccatcaccccc aggagccccagcgtggaggtgaagctgcccggctacggcaccgtgaccatggagtgca gccccaggaccggcctggacttcaacagggtggtgctgctgaccatgaagaagaagag ctggctggtgcacaagcagtggttcctggacctgcccctgccctggaccagcggcgcc accaccgagacccccacctggaacaggaaggagctgctggtgaccttcaagaacgccc acgccaagaggcaggacgtgaccgtgctgggcagccaggagggcgccatgcacagcgc cctggccggcgccaccgaggtggacagcggcgacggcaaccacatgttcgccggccac ctgaagtgcaaggtgaggatggagaagctgaccctgaagggcgtgagctacgtgatgt gcaccggcagcttcaagctggagaaggaggtggccgagacccagcacggcaccgtgct ggtgcaggtgaagtacgagggcaccgacgccccctgcaagatccccttcagcagccag gacgagaagggcgtgacccagaacggcaggctgatcaccgccaaccccgtggtgacca aaggaggagcccgtgaacatcgaggccgagccccccttcggcgagagcaacatcgt gatcggcgtgggcgacagcgccctgaccctgcactggttcaggaagggcagcagcatc ggcaagatgttcgagagcacctacaggggcgccaggaggatggccatcctgggcgaca ccgcctgggacttcggcagcgtgggcggcctgctgaccagcctgggcaaggccgtgca ccaggtgttcggcagcgtgtacaccacaatgtttggcggcgtctcttggatggtgaga atcctgatcgggttcctcgtcctgtggattggaaccaatagcaggaatacaagcatgg ccatgagctgcatcgctgttggcggcatcacactcttcctgggtttcaccgttcacgc a 265. >28A11 atgagatctaccatcaccctgctctgcctgatccctactgttatggccttccatctga (C15/full caacaagaaatggcgagcctaggatgatcgtgggcaagaacgaaagagggaagagcct length gctgttcaagactgaggacggcgtgaatatgtgcactctcatggccatcgacctgggc prM/full gagctgtgtgaggataccatcacctacaagtgtccactgctggtcaacaccgaacccg length E) aggatatcgattgctggtgcaatctgacttctacttgggtgacctatgggacctgtaa ccaggctggcgagcataggagggataagaggtccgtcgccctggctcctcacgttggt atgggcctggaaaccagaaccgagacatggatgagctctgagggagcttggaagcaga tccagaaggttgagacttgggctctgcgccatccaggctttacaatcctggccctgtt cctggctcatgcaattgggacctctattactcagaagggcatcatcttcattctgctc atgctcgtgacaccatctatggccatgaggtgcgtgggcatcggcaaccgcgacttcg tggagggcctgagcggcgccacctgggtggacgtggtgctggagcacggcagctgcgt gaccaccatggccaagaacaagcccaccctggacatcgagctgctgaagaccgaggtg accaaccccgccgtgctgaggaagctgtgcatcgaggccaagatcaccaacatcacca ccgacagcaggtgccccacccagggcgaggccatcctgcccgaggagcaggaccagaa ctacgtgtgcaagcacagcatggtggacaggggctggggcaacggctgcggcctgttc ggcaagggcagcctggtgacctgcgccaagttccagtgcctggagcccatcgagggca aggtggtgcagcacgagaacctgaagtacaccgtgatcatcaccgtgcacaccggcga ccagcaccaggtgggcaacgacacccagggcgtgaccgtggagatcaccccccaggcc agcaccgtggaggccatcctgcccgagtacggcaccctgggcctggagtgcagcccca ggaccggcctggacttcaacgagatgatcctgatgaagatgaagaagaagacctggct ggtgcacaggcagtggttcctggacctgcccctgccctggaccagcggcgccagcacc agccaggagacctggaacaggcaggacctgctggtgaccttcaagaacgcccacgcca agaagcaggaggtggtggtgctgggcagccaggagggcgccatgcacaccgccctgac cggcgccaccgagatccagaccagcggcaccaccaccatcttcgccggccacctgaag tgcaaggtgaggatggagaagctgcgcatcaagggcatgagctacaccatgtgcagcg gcaagttcagcatcgacaaggagatggccgagacccagcacggcaccaccgtggtgaa ggtgaagtacgagggcaccggcgccccctgcaaggtgcccatcgagatcagggacgtg aacaaggagaaggtggtgggcaggatcatcagcagcacccccctggccgagaacacca acagcgtgaccaacatcgagctggagccccccttcggcgacagctacatcgtgatcgg cgtgggcgacagcgccctgaccctgcactggttcaggaagggcagcagcatcggcaag atgttcgagagcacctacaggggcgccaagaggatggccatcctgggcgagaccgcct gggacttcggcagcgtgggcggcctgctgaccagcctgggcaaggccgtgcaccaggt gttcggcagcgtgtacaccacaatgtttggcggcgtctcttggatggtgagaatcctg atcgggttcctcgtcctgtggattggaaccaatagcaggaatacaagcatggccatga gctgcatcgctattggcatcatcacactctacctgggtgtggtggttcaggca 266. >28C1 atgaaaactagcctctgcctgatgatgatgctgcctgctactatggccttctctctgt (C15/full caacaagagatggcgagcctctcatgatcgtggccaagcacgaaagagggaggcctct length gctgttcaagactactgaagggatcaataagtgcactctcatcgccatggacctgggc prM/full gagctgtgtgaggataccatgacctacaagtgtccactgctgaggcagaacgaacccg length E) aggatatcgattgctggtgcaattcaacttctacttgggtgacctatgggacctgtaa ccaggctggcgagcataggagggataagaggtccgtcgccctggaccctcacgttggt ctgggcctggaaaccagaacccagacatggatgagcgctgagggagcttggaggcagg tcgagaaggttgaaacttgggctctgcgccatccaggctttacagtcattgccctgtt cctggctcatgcaattgggacctctattactcagaagggcatcatcttcattctgctc atgctcgtgacaccatctatggccatgaggtgcgtgggcatcggcaaccgcgacttcg tggagggcctgagcggcgccacctgggtggacgtggtgctggagcacggcggctgcgt gaccaccatggccaagaacaagcccaccctggacttcgagctgatcaagaccaccgcc aaggaggtggccctgctgaggacctactgcatcgaggccagcatcagcaacatcacca ccgccaccaggtgccccacccagggcgagccctacctgaaggaggagcaggaccagca gtacatctgcaggagggacgtggtggacaggggctggggcaacggctgcggcctgttc ggcaagggcggcgtggtgacctgcgccaagttcagctgcagcggcaagatcaccggca acctggtgcagatcgagaacctggagtacaccgtggtggtgaccgtgcacaacggcga cacccacgccgtgggcaacgacaccagcaaccacggcgtgaccgccaccatcaccccc aggagccccagcgtggaggtgaagctgcccgactacggcgagctgaccctggactgcg agcccaggagcggcatcgacttcaacgagatgatcctgatgaagatgaagaagaagac ctggctggtgcacaagcagtggttcctggacctgcccctgccctggaccgccggcgcc gacaccagcgaggtgcactggaaccacaaggagaggatggtgaccttcaaggtgcccc acgccaagaggcaggacgtgaccgtgctgggcagccaggagggcgccatgcacagcgc agtggccggcgccaccgaggtggacagcggcgacggcaaccacatgttcgccggccac ctgaagtgcaaggtgaggatgtagaagctgcgcatcaagggcatgagctacaccatgt gcagcggcaagttcagcatcgacaaggagatggccgagacccagcacggcaccaccgt ggtgaaggtgaagtacgagggcaccggcgccccctgcaaggtgcccatcgagatcagg gacgtgaacaaggagaaggtggtgggcaggatcatcagcagcacccccctggccgaga acaccaacagcgtgaccaacatcgagctggagccccccttcggcgacagctacatcgt gatcggcatcggcgacaaggccctgaagatcaactggttcaggaagggcagcagcatc ggcaagatgttcgaggccaccgccaggggcgccaggaggatggccatcctgggcgaca ccgcctgggacttcggcagcgtgggcggcctgctgaccagcctgggcaaggccgtgca ccaggtgttcggcagcgtgtacaccacaatgtttggcggcgtctcttggatggtgaga atcctgatcgggttcctcgtcctgtggattggaaccaatagcaggaatacaagcatgg ccatgagctgcatcgctgttggcggcatcacactcttcctgggtttcaccgttcacgc a 267. >28D11 atgagatctaccatcaccctgctctgcctgatccctactgttatggccttctctctgt (C15/full caacaagagatggcgagcctctcatgatcgtgggcaagaacgaaagagggaagagcct length gctgttcaagactgcttcagggatcaatatgtgcactctcatcgccatggacctgggc prM/full gagatgtgtgaggataccgtgacctacaagtgtccactgctggtcaacaccgaacccg length E) aggatatcgattgctggtgcaatctgacttctacttgggtgatgtatgggacctgtac ccaggctggcgagcataggagggataagaggtccgtcgccctggctcctcacgttggt atgggcctggacaccagaacccagacatggatgagctctgagggagcttggaagcaga tccagaaggttgagacttgggctctgcgccatccaggctttacagtcattgccctgtt cctggctcatgcaattgggacctctattactcagaagggcatcatcttcattctgctc atgctcgtgacaccatctatggccatgaggtgcgtgggcatcggcaaccgcgacttcg tggagggcctgagcggcgccacctgggtggacgtggtgctggagcacggcagctgcgt gaccaccatggccaagaacaagcccaccctggacatcgagctgcagaagaccgaggcc acccagctggccaccctgaggaagctgtgcatcgagggcaagatcagcaacaccacca ccgacagcaggtgccccacccagggcgaggccaccctgcccgaggagcaggaccagaa ctacgtgtgcaagcacacctacgtggacaggggctggggcaacggctgcggcctgttc ggcaagggcagcctggtgacctgcgccaagttccagtgcctggagcccatcgagggca aggtggtgcagcacgagaacctgaagtacaccgtgatcatcaccgtgcacaccggcga ccagcaccaggtgggcaacgacacccagggcgtgaccgtggagatcaccccccaggcc agcaccgtggaggccatcctgcccgagtacggcaccctgggcctggagtgcagcccca ggaccggcctggacttcaacgagatgatcctgctgaccatgaagaacaaggcctggat ggtgcacaggcagtggttcttcgacctgcccctgccctggaccagcggcgccaccacc gagacccccacctggaacaggaaggagctgctggtgaccttcaagaacgcccacgcca agaagcaggaggtggtggtgctgggcagccaggagggcgccatgcacaccgccctgac cggcgccaccgagatccagaccagcggcaccaccaccatcttcgccggccacctgaag tgcaggctgaagatggacaagctgaccctgaagggcgtgagctacgtgatgtgcaccg gcagcttcaagctggagaaggaggtggccgagacccagcacggcaccgtgctggtgca ggtgaagtacgagggcaccgacgccccctgcaagatccccttcagcagccaggacgag aagggcgtgacccagaacggcaggctgatcaccgccaaccccgtggtgaccaagaagg aggagcccgtgaacatcgaggccgagccccccttcggcgagagcaacatcgtgatcgg catcggcgacagcgccctgaccctgcactggttcaggaagggcagcagcatcggcaag atgttcgagagcacctacaggggcgccaagaggatggccatcctgggcgagaccgcct gggacttcggcagcgtgggcggcctgctgaccagcctgggcaaggccgtgcaccaggt gttcggcagcgtgtacaccacaatgtttggcggcgtctcttggatggtgagaatcctg atcgggttcctcgtcctgtggattggaaccaatagcaggaatacaagcatggccatga gctgcatcgctgttggcgtcatcacactctacctgggtgccatggttcaggca 268. >28E12 atgaaaactagcctctgcctgatgatgatgctgcctgctactctggccttccatctga (C15full catcaagagatggcgagcctaggatgatcgtgggcaagaacgaaagagggaagagcct length gctgttcaagactactgaagggatcaataagtgcactctcatcgccatggacctgggc prM/full gagatgtgtgaggataccgtgacctacaagtgtccactgctggtcaacaccgaacccg length E) aggatatcgattgctggtgcaatctgacttctacttgggtgacctatgggacctgtac caccacaggcgagcataggagggagaagaggtccgtcgccctggaccctcacgttggt ctgggcctggaaaccagaaccgagacatggatgagctctgagggagcttggaggcagg tcgagaaggttgaaacttgggctctgcgccatccaggctttacaatcctggccctgtt cctggctcattacattgggacctctctgactcagaaggtcgtcatcttcattctgctc atgctcgtgacaccatctatgaccatgaggtgcgtgggcgcgggcaacagggacttcg tggagggcctgagcggcgccacctgggtggacgtggtgctggagcacggcagctgcgt gaccaccatggccaagaacaagcccaccctggacttcgagctgatcaagaccgaggcc acccagcccgccaccctgaggaagtactgcatcgaggccaagatcaccaacatcacca ccgacagcaggtgccccacccagggcgaggccatcctgaacgaggagcaggacaagag gttcgtgtgcaagcacagcatggtggacaggggctggggcaacggctgcggcctgttc ggcaagggcggcgtggtgacctgcgccatgttccagtgcctggagcccatcgagggca aggtggtgcagcacgagaacctgaagtacaccgtgatcatcaccgtgcacaccggcga ccagcaccaggtgggcaacgacacccagggcgtgaccgtggagatcaccccccaggcc agcaccgtggaggccatcctgcccgagtacggcaccctgggcctggagtgcagcccca ggaccggcctggacttcaacgagatgatcctgctgaccatgaagaacaaggcctggat ggtgcacaggcagtggttcttcgacctgcccctgccctggaccagcggcgccaccacc gagacccccacctggaacaggaaggagctgctggtgaccttcaagaacgcccacgcca agaagcaggaggtggtggtgcggggcagccaggagggcgccatgcacagcgccctggc cggcgccaccgaggtggacagcggcgacggcaaccacatgttcgccggccacctgaag tgcaggctgaggatggacaagctgcagctgaagggcatgagctacagcatgtgcaccg gcaagttccagatcgtgaaggagatcgccgagacccagcacggcaccatcgtgatcag ggtgcagtacgagggcgacggcagcccctgcaagatccccctggagatcatggacctg gagaagaggcacgtgctgggcaggctgatcaccgtgaaccccatcgtgaccgagaagg acagccccgtgaacatcgaggccgagccccccttcggcgacagctacatcatcatcgg cgtgggcgacagcgccctgaccctgcactggttcaggaagggcagcagcatcggcaag atgttcgagagcacctacaggggcgccaggaggatggccatcctcggcgatacagcct gggattttggctcaatcggaggggtgttcaccagcgtgggtaaactgattcatcagat tttcggtacagcatatggcgtgctgttttccggcgtctcttggaccatgaagatcggt attggcatcctcctcacatggctgggaatgaatagcaggagcacaagcctgagcgtta aggcctcgtcctggttggcgtcatcacactctacctgggtgccatggttcaggca 269. >28F9 atgaacaggaggaggagaactgtgggcgtgattatcatgctgatccctactgctatgg (C15/full ccttccatctgacaacaagaaatggcgagcctcggatgatcgtggccaagcacgaaag length agggaggcctctgctgttcaagactactgaagggatcaataagtgcactctcatcgcc prM/full atggacctgggcgagatgtgtgaggataccgtgacctacaagtgtccactgctggtca length E) acaccgaacccgaggatatcgattgctggtgcaatctgacttctacttgggtgaccta tgggacctgtagccagacaggcgagcataggagggagaagaggtccgtcgccctggac cctcacgttggtctgggcctggaaaccagaaccgagacatggatgaactctgagggag cttggaagcagatccagaaggttgagacttgggctctgcgccatccaggctttacagt cattgccctgttcctggctcatgcaattgggacctctattactcagaagggcatcatc ttcattctgctcatgctcgtggcaccatcttacggcatgaggtgcgtgggcgtgggca acagggacttcgtggagggcgtgagcggcggcgcctgggtggacctggtgctggagca cggcggctgcgtgaccaccatggccaagaacaagcccaccctggacttcgagctgatc aagaccgaggccacccagcccgccaccctgaggaagtactgcatcgaggccaagctga ccaataccaccaccgacagcaggtgccccacccagggcgagcccagcctgaaggagga gcaggaccagcagtacatctgcaggagggacgtggtggacaggggctggggcaacggc tgcggcctgttcggcaagggcagcctggtgacctgcgccaagttccagtgcctggagc ccatcgagggcaaggtggtgcagcacgagaacctgaagtacaccgtgatcatcaccgt gcacaccggcgaccagcaccaggtgggcaacgacaccagcaaccacggcgtgaccgcc accatcacccccaggagccccagcgtggaggtgaagctgcccgactacggcgagctga ccctggactgcgagcccaggagcggcatcgacttcaacgagatgatcctgatgaagat gaagaagaagacctggctggtgcacaagcagtggttcctggacctgcccctgccctgg accgccggcgccgacaccagcgaggtgcactggaaccacaaggagaggatggtgacct tcaaggtaccccacgccaagaggcaggacgtgaccgtgctgggcagccaggagggcgc catgcacagcgccctggccggcgccaccgaggtggacagcggcgacggcaaccacatg ttcgccggccacctgaagtgcaaggtgaggatggagaagctgcgcatcaagggcatga gctacaccatgtgcagcggcaagttcagcatcgacaaggagatggccgagacccagca cggcaccaccgtggtgaaggtgaagtacgagggcaccggcgccccctgcaaggtgccc atcgagatcagggacgtgaacaaggagaaggtggtgggcaggatcatcagcagcaccc ccctggccgagaacaccaacagcgtgaccaacatcgagctggagccccccttcggcga cagctacatcgtgatcggcgtgggcgacagcgccctgaccctgcactggttcaagaag ggcagcagcatcggcaagatgttcgaggccaccgccaggggcgccaggaggatggcca tcctgggcgagaccgcctgggacttcggcagcgtgggcggcctgctgaccagcctggg caaggccgtgcaccaggtgttcggcagcgtgtacaccacaatgtttggcggcgtctct tggatggtgagaatcctgatcgggttcctcgtcctgtggattggaaccaatagcagga atacaagcatggccatgagctgcatcgctgttggcggcatcacactcttcctgggttt caccgttcacgca 270. >28H3 atgagacctaccatcaccctgctctgcctgatgatgatgctgcctgctactctggcct (C15/full tccatctgacaacaagaaatggcgagcctcacatgatcgtgggcaggcaagagaaggg length caagagcctgctgttcaagactgaggacggcgtgaatatgtgcactctcatggccatc prM/full gacctgggcgagctgtgtgaggataccatcacctacaagtgtccactgctgaggcaga length E) acgaacccgaggatatcgattgctggtgcaattcaacttctacttgggtgacctatgg gacctgtaccaccacaggcgagcataggagggagaagaggtccgtcgccctggttcct cacgttggtatgggcctggaaaccagaaccgagacatggatgagctctgagggagctt ggaagcacgcccagagggttgaatcttggattctgcgcaatccacgctttgcactcct ggctggcttcatggcttatatgattgggcagactggtattcagaggaccgtcttcttc gttctgatgatgctcgtggcaccatcttacggcatgaggtgcgtgggcgtgggcaaca gggacttcgtggagggcctgagcggcgccacctgggtggacgtggtgctggagcacgg cagctgcgtgaccaccatggccaagaacaagcccaccctggacatcgagctgctgaag accgaggtgaccaaccccgccgtgctgaggaagtactgcatcgaggccaagctgacca ataccaccaccgagagcaggtgccccacccagggcgaggccatcctgcccgaggagca ggaccagaactacgtgtgcaagcacacctacgtggacaggggctggggcaacggctgc ggcctgttcggcaagggcagcctggtgacctgcgccaagttccagtgcctggagccca tcgagggcaaggtggtgcagcacgagaacctgaagtacaccgtgatcatcaccgtgca caccggcgaccagcaccaggtgggcaacgacacccagggcgtgaccgtggagatcacc ccccaggccagcaccgtggaggccatcctgcccgagtacggcaccctgggcctggagt gcagccccaggaccggcctggacttcaacagggtggtgctgctgaccatgaagaacaa ggcctggatggtgcacaggcagtggttcttcgacctgcccctgccctggaccagcggc gccaccaccgagacccccacctggaacaggaaggagctgctggtgaccttcaagaacg cccacgccaagaagcaggaggtggtggtgctgggcagccaggagggcgccatgcacac cgccctgaccggcgccaccgagatccagatgagcagcggcaacctgctgttcaccggc cacctgaagtgcaggctgaggatggacaagctgcagctgaagggcatgagctacacca tgtgcagcggcaagttccagatcgtgaaggagatcgccgagacccagcacggcaccat cgtgatcagggtgcagtacgagggcgacggcagcccctgcaagatccccctggagatc atggacctggagaagaggcacgtgctgggcaggctgatcaccgtgaaccccatcgtga tcgacaaggagaagcccgtgaacatcgaggccgagccccccttcggcgacagctacat catcatcggcgtggagcccggccagctgaagctgcactggttcaagaagggcagcagc atcggccagatgttcgagaccaccatgaggggcgccaagaggatggccatcctgggcg acaccgcctgggacttcggcagcctgggcggcgtgttcaccagcatcggcaaggccct gcaccaggttttcggtgcaatctatggcgcagccttttccggcgtctcttggaccatg aagatcctgatcggggtcatcatcacatggattggaatgaatagcaggagcacaagcc tgagcgttagcctcgtcctggttggcgtcatcacactctacctgggtgccatggttca ggca 271. >28H9 atgagatctaccatcaccctgctctgcctgatccctactgttatggccttctctctgt (C15/full caacaagagatggcgagcctctcatgatcgtggccaagcacgaaagagggaggcctct length gctgttcaagactactgaagggatcaataagtgcactctcatcgccatggacctgggc prM/full gagatgtgtgaggataccgtgacctacaagtgtccactgctggtcaacaccgaacccg length E) aggatatcgattgctggtgcaatctgacttctacttgggtgatgtatgggacctgtac ccagtccggagagagaaggagggagaagaggtccgtcgccctggttcctcacgttggt atgggcctggaaaccagaaccgagacatggatgagcgctgagggagcttggaggcagg tcgagaaggttgaaacttgggctctgcgccatccaggctttacaatcctggccctgtt cctggctcattacattgggacctctctgactcagaaggtcgtcatcttcattctgctc atgctcgtgacaccatctatgaccatgaggtgcgtgggcgtgggcaacagggacttcg tggagggcgtgagcggcggcagctgggtggacatcgtgctggagcacggctcctgcgt gaccaccatggccaagaacaagcccaccctggacttcgagctgatcaagaccgaggcc acccagcccgccaccctgaggaagtactgcatcgaggccaagctgaccaataccacca ccgagagcaggtgccccacccagggcgagcccagcctgaacgaggagcaggacaagag gttcgtgtgcaagcacagcatggtggacaggggctggggcaacggctgcggcctgttc ggcaagggcagcctggtgacctgcgccaagttccagtgcctggagcccatcgagggca aggtggtgcagcacgagaacctgaagtacaccgtgatcatcaccgtgcacaccggcga ccagcaccaggtgggcaacgacacccagggcgtgaccgtggagatcaccccccaggcc agcaccgtggaggccatcctgcccgagtacggcaccctgggcctggagtgcagcccca ggaccggcctggacttcaacgagatgatcctgctgaccatgaagaacaaggcctggat ggtgcacaggcagtggttcttcgacctgcccctgccctggaccgccggcgccgacacc agcgaggtgcactggaaccacaaggagaggatggtgaccttcaaggtgccccacgcca agaggcaggacgtgaccgtgctgggcagccaggagggcgccatgcacaccgccctgac cggcaccaccgagatccagaacagcggcggcaccagcatcttcgccggccacctgaag tgcaggctgaagatggacaagctggagctgaagggcatgagctacgccatgtgcctga acaccttcgtgctgaagaaggaggtgagcgagacccagcacggcaccatcctgatcaa ggtggagtacaagggcgaggacgccccctgcaagatccccttcagcaccgaggacggc cagggcaaggcccacaacggcaggctgatcaccgccaaccccgtggtgaccaagaagg aggagcccgtgaacatcgaggccgagccccccttcggcgagagcaacatcgtgatcgg catcggcgacaaggccctgaagatcaactggtacaagaagggcagcagcatcggcaag atgttcgagagcacctacaggggcgccaagaggatggccatcctgggcgagaccgcct gggacttcggcagcgtgggcggcgtgctgaacagcctgggcaagatggtgcaccagat cttcggttcagcatataccgcactgttttccggcgtctcttggaccatgaagatcctg atcggggtcatcatcacatggattggaatgaatagcaggagcacaagcctgagcgtta gcctcgtcctggttggcgtcatcacactctacctgggtgccatggttcaggca 272. 6C6 PRM15 atgaccgtgttcttcatcctgatgatgctggtgaccccctctatggcc DNA 273. 6F4 PRM15 atggccgtgttcttcatcctgctgatgctggtgaccccctccatgaca DNA 274. 7A9 PRM15 atgaccgtgttcttcatcctgatgatgctggtggccccctcctacgcc DNA 275. 11E2 Atgaccgtgttcttcatcctgctgatgctggtggccccctcctacggc PRM15 DNA 276. 12E3 atggccgtgttcttcatcctgatgatgctggtggccccctcctacggc PRM15 DNA 277. 15C2 atggtggtgatcttcatcctgctgatgctggtgaccccctctatggcc PRM15 DNA 278. 17A12 atgaccgtgttcttcatcctgatgatgctggtgaccccctccatgaca PRM15 DNA 279. 17D5 atgaccgtgttcttcatcctgctgatgctggtgaccccctccatgaca PRM15 DNA 280. 17G5 atgaccgtgttcttcatcctgatgatgctggtggccccctctatggcc PRM15 DNA 281. DEN-1 atgggcatcatcttcatcctgctgatgctggtgaccccctctatggcc PRM15 of DEN-1 DEN15/tE CO parent 282. DEN-2 atgggcctgatcctgatcctgcagaccgccgtggccccctccatgaca PRM15 of DEN-2 PPM15/tE CO parent 283. DEN-3 atggtggtgatcttcatcctgctgatgctggtgaccccctccatgaca PRM15 of DEN-3 PRM15/LE CO parent 284. DEN-4 atgaccgtgttcttcatcctgatgatgctggtggccccctcctacggc PRM15 of DEN-4 PRM15/tE CO parent 285. 2/7-NPRM atgaggtgcatcggcatcagcaacagggacttcgtggagggcgtgagcggcgccacct Round I gggtggacgtggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcc shuffled caccctggacatcgagctgatcaagaccgaggccacccagcccgccaccctgaggaag clone no tactgcatcgaggccaagctgaccaataccaccaccgagagcaggtgccccacccagg PRM gcgagccctacctgaaggaggagcaggaccagaactacgtgtgcaagcacacctacgt ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgc gccaagttcaagtgcgtgaccaagctggagggcaagatcgtgcagtacgagaacctga agtacagcgtgatcgtgaccgtgcacaccggcgaccagcaccaggtgggcaacgagac caccgagcacggcaccatcgccaccatcaccccccaggcccccacdagcgagatccag ctgaccgactacggcgccctgaccctggactgcagccccaggaccggcctggacttca acagggtggtgctgctgaccatgaagaagaagacctggctggtgcacaagcagtggtt cctggacctgcccctgccctggaccgccggcgccagcaccagccaggagacctggaac aggaaggagctgctggtgaccttcaagaacgcccacgccaagaagcaggaggtggtgg tgctgggcagccaggagggcgccatgcacaccgccctgaccggcgccaccgaggtgga cagcggcgacggcaacctgctgttcaccggccacctgaagtgcaggctgaagatggac aagctgcagctgaagggcatgagctacagcatgtgcaccggcaagttccagatcgtga aggagatcgccgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcga cggcgccccctgcaagatccccttcagcaccgaggacggccagggcaaggcccacaac ggcaggctgatcaccgccaaccccatcgtgatcgacaaggagaagcccgtgaacatcg agctggagccccccttcggcgacagctacatcgtggtgggcgccggcgacaaggccct gaagatcaactggtacaagaagggcagcagcatcggcaagatgttcgagagcacctac aggggcgccaagaggatggccatcctgggcgagaccgcctgggacttcggcagcgtgg gcggcgtgttcaccagcatcggcaaggccctgcaccaggtgttcggcgccatctacgg cgcc 286. 2G11-NPRM atgaggtgcgtgggcatcggcaaccgcgacttcgtggagggcctgagcggcgccacct Round I gggtggacgtggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcc shuffled caccctggacttcgagctgatcaagaccaccgccaaggaggtggccctgctgaggacc clone no tactgcatcgaggccagcatcagcaacatcaccaccgccaccaggtgccccacccagg PRM gcgagccctacctgaaggaggagcaggaccagcagtacatctgcaggagggacgtggt ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgc gccaagttcaagtgcgtgaccaagctggagggcaacatcgtgcagcccgagaacctgg agtacaccatcgtgatcaccccccacaccggcgaccagcaccaggtgggcaacgacac ccagggcgtgaccgtggagatcaccccccaggccagcaccgtggaggccatcctgccc gagtacggcaccctgggcctggagtgcagccccaggactggcctggacttcaacgaga tgatcctgctgaccatgaagaacaaggcctggatggtgcacaggcagtggttcttcga cctgcccctgccctggaccagcggcgccaccaccgagacccccacctggaacaggaag gagctgctggtgaccttcaagaacgcccacgccaagaagcaggaggtggtggtgctgg gcagccaggagggcgccatgcacagcgccctggccggcgccaccgagatccagatgag cagcggcaacctgctgttcaccggccacctgaagtgcaggctgaagatggacaagctg cagctgaagggcatgagctacaccatgtgcaccggcaagttccagatcgtgaaggaga tcgccgagacccagcacggcaccatcctgatcaaggtggagtacaagggcgaggacgc agcctgcaagatccccttcagcaccgaggacgagaagggcgtgacccagaacggcagg ctgatcaccgccaaccccatcgtgaccgagaaggacagccccgtgaacatcgaggccg agccccccttcggcgagagctacatcgtggtgggcgccggcgagaaggccctgaagct gagetggttcaagaagggcagcagcatcggccagatgttcgagagcacctacaggggc gccaagaggatggccatcctgggcgagaccgcctgggacttcggcagcgtgggcggcc tgctgaccagcctgggcaaggccgtgcaccaggtgttcggcagcgtgtacaccaccat gttcggc 287. 5/21-NPRM atgaggtgcgtgggcgtgggcaacagggacttcgtggagggcctgagcggcgccacct Round I gggtggacgtggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcc shuffled caccctggacatcgagctgctgaagaccgaggtgaccaaccccgccaccctgaggaag clone tactgcatcgaggccaagatcaccaacatcaccaccgacagcaggtgccccacccagg no PPM gcgaggccatcctgcccgaggagcaggaccagaactacgtgtgcaagcacacctacgt ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcctggtgacctgc gccaagtteacetgcaagaagaacatggagggcaacatcgtgcagcccgagaacctgg agtacaccatcgtgatcaccccccacagcggcgaggagcacgccgtgggcaacgacac tggcaagcacggcaaggagatcaagatcaccccccagagcagcatcaccgaggccgag ctgaccggctacggcaccgtgaccatggagtgcagccccaggaccggcctggacttca acagggtggtgctgctgaccatgaagaagaagagctggctggtgcacaagcagtggtt cctggacctgcccctgccctggaccgccggcgccgacaccagcgaggtgcactggaac cacaaggagaggatggtgaccttcaaggtgccccacgccaagaggcaggacgtgaccg tgctgggcagccaggagggcgccatgcacagcgccctggccggcgccaccgagatcca gatgagcagcggcaacctgctgttcaccggccacctgaagtgcaggctgaggatggac aagctgcagctgaagggcatgagctacgtgatgtgcaccggcaagttccagatcgtga aggagatcgccgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcga cggcagcccctgcaagatccccctggagatcatggacctggagaagaggcacgtgctg ggcaggctgatcaccgtgaaccccatcgtgatcgacaaggagaagcccgtgaacatcg aggccgagccccccttcggcgagagctacatcgtggtcggcgtgggcgacagcgccct gaccctgcactggttcaggaagggcagcagcatcggccagatgttcgagaccaccatg aggggcgccaagaggatggccatcttgggcgacaccgcctgggacttcggcagcctgg gcggcgtgttcaccagcatcggcaaggccctgcaccaggtgttcggcgccatctacgg cgcc 288. 6C-NPRM atgaggtgcgtgggcatcggcaaccgcgacttcgtggagggcgtgagcggcggcgcct Round I gggtggacctggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcc shuffled caccctggacttcgagctgatcaagaccaccgccaaggaggtggccctgctgaggacc clone no tactgcatcgaggccagcatcagcaacatcaccaccgccaccaggtgccccacccagg PPM gcgagccctacctgaaggaggagcaggaccagcagtacatctgcaggagggacgtggt ggacaggggctggggcaacggctgcggcctgttcggcaagggcggcatcgtgacctgc gccatgttcacctgcaagaagaacatggagggcaacatcgtgcagcccgagaacctgg agtacaccatcgtgatcaccccccacagcggcgaggagcacgccgtgggcaacgacac tggcaagcacggcaaggagatcaagatcaccccccagagcagcatcaccgaggccgag ctgaccggctacggcaccgtgaccatggagtgcagccccaggaccggcctggacttca acgagatgatcctgatgaagatgaagaacaaggcctggatggtgcacaggcagtggtt cctggacctgcccctgccctggaccgccggcgccgacaccgagacccccacctggaac aggaaggagctgctggtgaccttcaagaacgcccacgccaagaagcaggaggtggtgg tgctgggcagccaggagggcgccatgcacagcgccctggccggcgccaccgaggtgga cagcggcgacggcaaccacatgttcgccggccacctgaagtgcaaggtgaggatggac aagctgcagctgaagggcatgagctacagcatgtgcaccggcaagttccagatcgtga aggagatcgccgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcga cggcagcccctgcaagatccccctggagatcatggacctggagaagaggcacgtgctg ggcaggctgatcactgtgaaccccatcgtgaccgagaaggacagccccgtgaacatcg aggccgagccccccttcggcgacagctacatcatcatcggcgtggagcccggccagct gaagctgcactggttcaagaagggcagcagcatcggcaagatgttcgaggccaccgcc aggggcgccaggaggatggccatcctgggcgacaccgcctgggacttcggcagcgcct acacc 289. 6E12-NPRM atgaggtgcgtgggcgtgggcaacagggacttcgtggagggcctgagcggcgccacct Round I gggtggacgtggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcc shuffled caccctggacatcgagctgctgaagaccgaggtgaccaaccccgccgtgctgaggaag clone no ctgtgcatcgaggccagcatcagcaacatcaccaccgccaccaggtgccccacccagg PRM gcgaggccatcctgcccgaggagcaggaccagaactacgtgtgcaagcacacctacgt ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcgtggtgacctgc gccaagttccagtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctga agtacaccgtgatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgacac ccagggcgtgaccgtggagatcaccccccaggccagcaccgtggaggccatcctgccc gagtacggcaccctgggcctggagtgcagccccaggaccggcctggacttcaacgaga tgatcctgctgaccatgaagaacaaggcctggatggtgcacaggcagtggttcttcga cctgcccctgccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaag gagaggatggtgaccttcaagaccgcccacgccaagaagcaggacgtggtggtgctgg gcagccaggagggcgccatgcacaccgccctgaccggcgccaccgagatccagatgag cagcggcaccaccaccatcttcgccggccacctgaagtgcaaggtgaggatggacaag ctgaccctgaagggcgtgagctacgtgatgtgcaccggcagcttcaagctggagaagg aggtggccgagacccagcacggcaccgtgctggtgcaggtgaagtacgagggcaccga cgccccctgcaagatccccttcagcagccaggacgagaagggcgtgacccagaacggc aggctgatcaccgccaaccccatcgtgatcgacaaggagaagcccgtgaacatcgagg ccgagcccccctttggcgacagctacatcatcatcggcgtggagcccggccagctgaa gctgcactggttcaagaagggcagcagcatcggccagatgttcgagaccaccatgagg ggcgccaagaggatggccatcctgggcgacaccgcctgggacttcggcagcgtgggcg gcctgctgaccagcctgggcaagatggtgcaccagatcttcggcagcgtgtacaccac catgttcggc 290. 6F4-NPRM atgaggtgcgtgggcatcggcaaccgcgacttcgtggagggcgtgagcggcggcgcct Round I gggtggacctggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcc shuffled caccctggacttcgagctgatcaagaccgaggccacccagcccgccaccctgaggaag clone no tactgcatcgaggccaagctgaccaataccaccaccgagagcaggtgccccacccagg PRM gcgaggccatcctgcccgaggagcaggaccagcagtacatctgcaggagggacgtggt ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgc gccaagttccagtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctga agtacaccgtgatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgacac ccagggcgtgaccgtggagatcaccccccaggccagcaccgtggaggccatcctgccc gagtacggcaccctgggcctggagtgcagccccaggaccggcctggacttcaacgaga tgatcctgctgaccatgaagaagaagagctggctggtgcacaagcagtggttcctgga cctgcccctgccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaag gagaggatggtgaccttcaaggtgccccacgccaagaggcaggacgtgaccgtgctgg gcagccaggagggcgccatgcacagcgccctggccggcgccaccgaggtggacagcgg cgacggcaaccacatgttcgccggccacctgaagtgcaaggtgaggatggagaagctg cgcatcaagggcatgagctacagcatgtgcaccggcaagttccagatcgtgaaggaga tcgccgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcgacggcag cccctgcaagatccccctggagatcatggacctggagaagaggcacgtgctgggcagg ctgatcaccgtgaaccccatcgtgaccgagaaggacagccccgtgaacatcgaggccg agccccccttcggcgacagctacatcatcatcggcgtggagcccggccagctgaagct gcactggttcaagaagggcagcagcatcggccagatgttcgaggccaccgccaggggc gccaggaggatggccatcctgggcgacaccgcctgggacttcggcagcgtgggcggcg agtgttcaccagcatcggcaaggccctgcaccaggtgttcggcgccatctacggcgcc 291. 7A9-NPRM atgaggtgcgtgggcatcggcaaccgcgacttcgtggagggcgtgagcggcgccacct Round I gggtggacgtggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcc shuffled caccctggacatcgagctgcagaagaccgaggccacccagctggccaccctgaggaag clone no ctgtgcatcgaggccagcatcagcaacatcaccaccgccaccaggtgccccacccagg PRM gcgaggccaccctggtggaggagcaggacaccaacttcgtgtgcaggaggaccttcgt ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcctggtgacctgc gccaagttccagtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctga agtacaccatcgtgatcaccccccacagcggcgaggagcacgccgtgggcaacgacac tggcaagcacggcaaggagatcaagatcaccccccagagcagcatcaccgaggccgag ctgaccggctacggcaccgtgaccatggagtgcagccccaggaccggcctggacttea acgagatgatcctgctgaccatgaagagcaagacctggctggtgcacaagcagtggtt cctggacctgcccctgccctggaccgccggcgccgacaccagcgaggtgcactggaac cacaaggagaggatggtgaccttcaagaacgcccacgccaagaagcaggaggtggtgg tgctgggcagccaggagggcgccatgcacaccgccctgaccggcaccaccgagatcca gaacagcggcaacctgctgttcaccggccacctgaagtgcaggctgaggatggacaag ctgcagctgaagggcatgagctacagcatgtgcaccggcaagttccagatcgtgaagg agatcgccgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcgacgg cagecectgcaagatccccctggagatcatggacctggagaagaggcacgtgctgggc aggctgatcaccgtgaaccccatcgtgaccgagaaggacagccccgtgaacatcgagg ccgagccccccttcggcgacagctacatcatcatcggcgtggagcccggccagctgaa gctgcactggttcaggaagggcagcagcatcggcaagatgttcgagagcacctacagg ggcgccaagaggatggccatcctgggcgagaccgcctgggacttcggcagcgtgggcg gcctgctgaccagcctgggcaaggccgtgcaccaggtgttcggcagcgtgtacaccac catgttcggc 292. 11E2-NPRM atgaggtgcgtgggcgtgggcaacagggacttcgtggagggcctgagcggcgccacct Round II gggtggacgtggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcc shuffled caccctggacatcgagctgctgaagaccgaggtgaccaaccccgccgtgctgaggaag clone no ctgtgcatcgaggccagcatcagcaacatcaccaccgccaccaggtgccccacccagg PRM gcgaggccatcctgcccgaggagcaggaccagaactacgtgtgcaagcacacctacgt ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcctggtgacctgc gccaagttcaagtgcgtgaccaagctggagggcaagatcgtgcagtacgagaacctga agtacagcgtgatcgtgaccgtgcacaccggcgaccagcaccaggtgggcaacgagac caccgagcacggcaccatcgccaccatcaccccccaggcccccaccagcgagatccag ctgaccgactacggcgccctgaccctggactgcagccccaggaccggcctggacttca acagggtggtgctactgaccatgaagaagaagacctggctggtgcacaagcagtggtt cctggacctgcccctgccctggaccgccggcgccagcaccagccaggagacctggaac aggaaggagctgctggtgaccttcaagaacgcccacgccaagaggcaggacgtgaccg tgctgggcagccaggagggcgccatgcacagcgccctgaccggcgccaccgagatcca gatgagcagcggcaacctgctgttcgccggccacctgaagtgcaggctgaagatggac aagctgcagctgaagggcatgagctacagcatgtgcaccggcaagttccagatcgtga aggagatcgccgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcga cggcgccccctgcaagatccccctggagatcatggacctggagaagaggcacgtgctg ggcaggctgatcaccgtgaaccccatcgtgaccgagaaggacagccccgtgaacatcg aggccgagccccccttcggcgagagctacatcgtggtcggcgtgggcgacagcgccct gaccctgcactggttcaggaagggcagcagcatcggccagatgttcgaggccaccgcc aggggcgccaagaggatggccatcctgggcgacaccgcctgggacttcggcagcgtgg gcggcgtgttcaccagcatcggcaaggccctgcaccaggtgttcggcgccatctacgg cgcc 293. 12E3-NPRM atgaggtgcgtgggcgtgggcaacagggacttcgtggagggcctgagcggcgccacct Round II gggtggacgtggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcc shuffled caccctggacatcgagctgatcaagaccaccgccaaggaggtggccctgctgaggacc clone no tactgcatcgaggccagcatcagcaacatcaccaccgccaccaggtgccccacccagg PRM gcgaggccatcctgcccgaggagcaggaccagaactacgtgtgcaagcacacctacgt ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcctggtgacctgc gccaagttcacctgcaagaagaacatggagggcaacatcgtgcagcccgagaacctgg agtacaccatcgtgatcaccccccacagcggcgaggagcacgccgtgggcaacgacac tggcaagcacggcaaggagatcaagatcaccccccagagcagcatcaccgaggccgag ctgaccggctacggcaccgtgaccatggagtgcagccccaggaccggcctggacttca acagggtggtgctgctgaccatgaagaagaagagctggctggtgcacaagcagtggtt cctggacctgcccctgccctggaccgccggcgccagcaccagccaggagacctggaac aggaaggagctgctggtgaccttcaagaacgcccacgccaagaagcaggaggtggtgg tgctgggcagccaggagggcgccatgcacaccgccctgaccggcgccaccgagatcca gatgagcagcggcaacctgctgttcaccggccacctgaagtgcaggctgaggatggac aagctgcagctgaagggcatgagctacagcatgtgcaccggcaagttccagatcgtga aggagatcgccgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcga cggcgccccctgcaagatccccttcagcaccgaggacggccagggcaaggcccacaac ggcaggctgatcaccgccaaccccatcgtgatcgacaaggagaagcccgtgaacatcg agctggagccccccttcggcgagagctacatcgtggtgggcgccggcgacaaggccct gaagatcaactggtacaagaagggcagcagcatcggcaagatgttcgagagcacctac aggggcgccaagaggatggccatcctgggcgagaccgcctgggacttcggcagcgtgg gcggcctgctgaccagcctgggcaaggccgtgcaccaggtgttcggcagcgtgtacac caccatgttcggc 294. 13E2-NPRM atgaggtgcgtgggcatcggcaaccgcgacttcgtggagggcgtgagcggcgccacct Round II gggtggacgtggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcc shuffled caccctggacttcgagctgatcaagaccaccgccaaggaggtggccctgctgaggacc clone no tactgcatcgaggccagcatcagcaacatcaccaccgccaccaggtgccccacccagg PPM gcgagccctacctgaaggaggagcaggaccagcagtacatctgcaggagggacgtggt ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgc gccaagttcaagtgcgtgaccaagctggagggcaacatcgtgcagcccgagaacctgg agtacaccatcgtgatcaccccccacagcggcgaggagcacgccgtgggcaacgacac tggcaagcacggcaaggagatcaagatcaccccccagagcagcatcaccgaggccgag ctgaccggctacggcaccgtgaccatggagtgcagccccaggaccggcctggacttca acgagatgatcctgctgaccatgaagaagaagagctggctggtgcacaagcagtggtt cctggacctgcccctgccctggaccgccggcgccagcaccagccaggagacctggaac aggaaggagctgctggtgaccttcaagaacgcccacgccaagaagcaggaggtggtgg tgctgggcagccaggagggcgccatgcacaccgccctgaccggcgccaccgagatcca gatgagcagcggcaacctgctgttcaccggccacctgaagtgcaggctgaggatggac aagctgcagctgaagggcatgagctacaccatgtgcaccggcaagttccagatcgtga aggagatcgccgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcac cgacgccccctgcaagatccccttcagcaccgaggacggccagggcaaggcccacaac ggcaggctgatcaccgccaaccccatcgtgatcgacaaggagaagcccgtgaacatcg agctggagccccccttcggcgacagctacatcatcatcggcgtggagcccggccagct gaagctgcactggttcaagaagggcagcagcatcggcaagatgttcgaggccaccgcc aggggcgccaagaggatggccatcctgggcgagaccgcctgggacttcggcagcgtgg gcggcctgctgaccagcctgggcaagatggtgcaccagatcttcggcagcgtgtacac caccatgttcggc 295. 14E9-NPRM atgaggtgcgtgggcatcggcaaccgcgacttcgtggagggcgtgagcggcggcgcct Round II gggtggacctggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcc shuffled caccctggacatcgagctgatcaagaccgaggccacccagcccgccaccctgaggaag clone no tactgcatcgaggccaagatcaccaacatcaccaccgacagcaggtgccccacccagg PPM gcgaggccatcctgcccgaggagcaggaccagcagtacatctgcaggagggacgtggt ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcgtggtgacctgc gccaagttccagtgcgtgaccaagctggagggcaacatcgtgcagcccgagaacctgg agtacaccatcgtgatcaccccccacaccggcgaccagcaccaggtgggcaacgacac ccagggcgtgaccgtggagatcaccccccaggccagcaccgtggaggccatcctgccc gagtacggcaccctgggcctggagtgcagccccaggaccggcctggacttcaacgaga tgatcctgatgaagatgaagaacaaggcctggatggtgcacaagcagtggttcctgga cctgcccctgccctggaccagcggcgccagcaccagccaggagacctggaacaggaag gagctgctggtgaccttcaagaacgcccacgccaagaagcaggaggtggtggtgctgg gcagccaggagggcgccatgcacagcgccctggccggcgccaccgagatccagatgag cagcggcaacctgctgttcaccggccacctgaagtgcaggctgaagatggacaagctg cagctgaagggcatgagctacaccatgtgcaccggcaagttccagatcgtgaaggaga tcgccgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcgacggcag cccctgcaagatccccctggagatcatggacctggagaagaggcacgtgctgggcagg ctgatcaccgccaaccccatcgtgatcgacaaggagaagcccgtgaacatcgagctgg agccccccttcggcgacagctacatcgtggtgggcgccggcgacaaggccctgaagat caactggtacaagaagggcagcagcatcggcaagatgttcgagagcacctacaggggc ggccaagaggatggccatcctgggcgagaccgcctgggacttcggcagcgcctacacc 296. 13E11- atgaggtgcgtgggcgtgggcaacagggacttcgtggagggcgtgagcggcggcgcct NPRM gggtggacgtggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcc Round II caccctggacatcgagctgatcaagaccgaggccacccagcccgccaccctgaggaag shuffled tactgcatcgaggccagcatcagcaacatcaccaccgacaccaggtgccccacccagg clone no gtgagccctacctgaaggaggagcaggaccagaactacgtgtgcaagcacacctacgt PRM ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgc gccaagttcaagtgcgtgaccaagctggagggcaagatcgtgcagtacgagaacctga agtacagcgtgatcgtgaccgtgcacaccggcgaccagcaccaggtgggcaacgagac caccgagcacggcaccatcgccaccatcaccccccaggcccccaccagcgagatccag ctgaccgactacggcaccgtgaccatggagtgcagccccaggaccggcctggacttca acgagatgatcctgctgaccatgaagagcaagacctggctagtgcacaagcagtggtt cctggacctgcccctgccctggaccgccggcgccgacaccgagacccccacctggaac aggaaggagctgctggtgaccttcaagaacgcccacgccaagaagcaggaggtggtgg tgctgggcagccaggagggcgccatgcacaccgccctggccggcgccaccgagatcca gatgagcagcggcaacctgctgttcaccggccacctgaagtgcaggctgaagatggac aagctgcagctgaagggcatgagctacaccatgtgcaccggcaagttccagatcgtga aggagatcgccgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcac cgacgccccctgcaagatccccttcagcaccgaggacggccagggcaaggcccacaac ggcaggctgatcaccgccaaccccatcgtgatcgacaaggagaagcccgtgaacatcg agctggagccccccttcggcgagagctacatcgtggtgggcgccggcgagaaggccct gaccctgcactggttcaggaagggcagcagcatcggccagatgttcgagaccaccatg aggggcgccaagaggatggccatcttgggcgagaccgcctgggacttcggcagcgtgg gcggcgtgttcaccagcatcggcaaggccctgcaccaggtgttcggcgccatctacgg cgcc 297. 16E8-NPRM atgaggtgcgtgggcgtgggcaacagggacttcgtggagggcctgagcggcgccacct Round II gggtggacgtggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcc shuffled caccctggacttcgagctgatcaagaccaccgccaaggaggtggccctgctgaggacc clone no tactgcatcgaggccagcatcagcaacatcaccaccgccaccaggtgccccacccagg PRM gcgagccctacctgaaggaggagcaggaccagcagtacatctgcaggagggacgtggt ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgc gccaagttcaagtgcgtgaccaagctggagggcaacatcgtgcagcccgagaacctgg agtacaccatcgtgatcaccccccacaccggcgaccagcaccaggtgggcaacgacac ccagggcgtgaccgtggagatcaccccccaggccagcaccgtggaggccatcctgccc gagtacggcaccctgggcctggagtgcagccccaggaccggcctggacttcaacgaga tgatcctgctgaccatgaagaacaaggcctggatggtgcacaagcagtggttcctgga cctgcccctgccctggaccagcggcgccaccaccgagacccccacctggaacaggaag gagctgctggtgaccttcaagaccgcccacgccaagaagcaggacgtggtggtgctgg gcagccaggagggcgccatgcacaccgccctgaccggcgccaccgagatccagatgag cagcggcaacctgctgttcaccggccacctgaagtgcaggctgaggatggacaagctg cagctgaagggcatgagctacgtgatgtgcaccggcaagttccagatcgtgaaggaga tcgccgagacccagcacggcaccatcctgatcaaggtggagtacaagggcgaggacgc cccctgcaagatccccttcagcaccgaggacggccagggcaaggcccacaacggcagg ctgatcaccgccaaccccatcgtgatcgacaaggagaagcccgtgaacatcgaggccg agccccccttcggcgagagcaacatcgtgatcggcatcggcgacaaggccctgaagat caactggtacaagaagggcagcagcatcggcaagatgttcgagaccaccatgaggggc gccaagaggatggccatcttgggcgacaccgcctgggacttcggcagcgtgggcggcg tgttcaccagcatcggcaagccctgcaccaggtgttcggcgccatctacggcgcc 298. 16E10- atgaggtgcatcggcatcagcaacagggacttcgtggagggcgtgagcggcgccacct NPRM gggtggacgtggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcc Round II caccctggacatcgagctgctgaagaccgaggtgaccaaccccgccaccctgaggaag shuffled tactgcatcgaggccaaaatcagcaacaccaccaccgacagcaggtgcodcacccagg clone no gcgaggccaccctggtggaggagcaggaccagaactacgtgtgcaagcacacctacgt PRM ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgc gccaagttcaagtgcgtgaccaagctggagggcaagatcgtgcagtacgagaacctga agtacagcgtgatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgacac ccagggcgtgaccgtggagatcaccccccaggccagcaccgtggaggccatcctgccc gagtacggcgccctgggcctggagtgcagccccaggaccggcctggacttcaacgaga tgatcctgctgaccatgaagaacaaggcctggatggtgcacgggcagtggttcttcga cctgcccctgccctggaccagcggcgccaccaccgagacccccacctggaacaggaag gagctgctggtgaccttcaagaacgcccacgccaagaagcaggaggtggtggtgctgg gcagccaggagggcgccatgcacagcgccctggccggcgccaccgagatccagatgag cagcggcaacctgctgttcaccggccacctgaagtgcaggctgaggatggacaagctg cagctgaagggcatgagctacagcatgtgcaccggcaagttccagatcgtgaaggaga tcgccgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcgacggcag agcctgcaagatccccctggagatcatggacctggagaagaggcacgtgctgggcagg ctgatcaccgtgaaccccatcgtgatcgacaaggagaagcccgtgaacatcgaggccg agccccccttcggcgagagctacatcgtggtcggcgtgggcgacagcgccctgaccct gcactggttcaggaagggcagcagcatcggccagatgttcgagaccaccatgaggggc gccaagaggatggccatcttgggcgacaccgcctgggacttcggcagcgtgggcggcg tgctgaacagcctgggcaagatggtgcaccagatcttcggcagcgtgtacaccaccat gttcggc 299. 18E9-NPRM atgaggtgcgtgggcgtgggcaacagggacttcgtggagggcctgagcggcgccacct Round II gggtggacgtggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcc shuffled caccctggacttcgagctgatcaagaccgaggccacccagcccgccaccctgaggaag clone no tactgcatcgaggccaagctgaccaataccaccaccgccaccaggtgccccacccagg PRM gcgagccctacctgaaggaggagcaggaccagaactacgtgtgcaagcacacctacgt ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcgtggtgacctgc gccaagttccagtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctga agtacagcgtgatcgtgaccgtgcacaccggcgaccagcaccaggtgggcaacgacac ccagggcgtgaccgtggagatcaccccccaggccagcaccgtggaggccatcctgccc gagtacggcaccctgggcctggagtgcagccccaggaccggcctggacttcaacgaga tgatcctgctgaccatgaagaagaagagctggctggtgcacaagcagtggttcctgga cctgcccctgccctggaccagcggcgccagcaccagccaggagacctggaacaggaag gagctgctggtgaccttcaagaacgcccacgccaagaggcaggacgtgaccgtgctgg gcagccaggagggcgccatgcacaccgccctgaccggcgccaccgaggtggacagcgg cgacggcaacctgctgttcaccggccacctgaagtgcaggctgaagatggacaagctg cagctgaagggcatgagctacaccatgtgcaccggcaagttccagatcgtgaaggaga tcgccgagacccagcacggcaccatcctgatcaaggtggagtacaagggcgaggacgc cccctgcaagatccccttcagcagccaggacgagaagggcgtgacccagaacggcagg ctgatcaccgccaaccccatcgtgatcgacaaggagaagcccgtgaacatcgagctgg agccccccttcggcgacagctacatcgtggtgggcgccggcgacaaggccctgaagat caactggtacaagaagggcagcagcatcggcaagatgttcgagagcacctacaggggc gccaagaggatggccatcctgggcgacaccgcctgggacttcggcagcgtgggcggcg tgctgaacagcctgggcaagatggtgcaccagatcttcggcagcgtgtacaccaccat gttcggc 300. 18E10- atgaggtgcgtgggcgtgggcaacagggacttcgtggagggcctgagcggcgccacct NPRM gggtggacgtggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcc Round II caccctggacatcgagctgcagaagaccgaggccacccagctggccaccctgaggaag shuffled tactgcatcgaggccaagctgaccaataccaccaccgagagcaggtgccccacccagg clone no gcgaggccatcctgcccgaggagcaggaccagaactacgtgtgcaagcacacctacgt PRM ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgc gccaagttcaagtgcgtgaccaagctggagggcaagatcgtgcagtacgagaacctga agtacagcgtgatcgtgaccgtgcacaccggcgaccagcaccaggtgggcaacgagac caccgagcacggcaccatcgccaccatcaccccccaggcccccaccagcgagatccag ctgaccgactacggcgccctgaccctggactgcagccccaggaccggcctggacttca acagggtggtgctgctgaccatgaagaagaagagctggctggtgcacaagcagtggtt cctggacctgcccctgccctggaccgccggcgccgacaccagcgaggtgcactggaac cacaaggagaggatggtgaccttcaagaccgcccacgccaagaagcaggaggtggtgg tgctgggcagccaggagggcgccatgcacaccgccctgaccggcgccaccgaggtgga cagcggcgacggcaacctgctgttcaccggccacctgaagtgcaggctgaggatggac aagctgcagctgaagggcatgagctacagcatgtgcaccggcaagttccagatcgtga aggagatcgccgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcga cggcagcccctgcaagatccccctggagatcatggacctggagaagaggcacgtgctg ggcaggctgatcaccgccaaccccatcgtgatcgacaaggagaagcccgtgaacatcg agctggagccccccttcggcgacagctacatcgtggtgggcgccggcgacaaggccct gaagatcaactggtacaagaagggcagcagcatcggccagatgttcgagaccaccatg aggggcgccaagaggatggccatcttgggcgacaccgcctgggacttcggcagcgtgg gcggcctgctgaccagcctgggcaaggccgtgcaccaggtgttcggcagcgtgtacac caccatgttcggc 301. 18E11- atgaggtgcgtgggcgtgggcaacagggacttcgtggagggcctgagcggcgccacct NPRM gggtggacgtggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcc Round II caccctggacttcgagctgatcaagaccaccgtcaaggaggtggccctgctgaggacc shuffled tactgcatcgaggccagcatcagcaacatcaccaccgccaccaggtgccccacccagg clone no gcgagccctacctgaaggaggagcaggaccagcagtacatctgcaggagggacgtggt PRM ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgc gccaagttcaagtgcgtgaccaagctggagggcaacatcgtgcagcccgagaacctgg agtacaccatcgtgatcaccccccacaccggcgaccagcaccaggtgggcaacgacac ccagggcgtgaccgtggagatcaccccccaggccagcaccgtggaggccatcctgccc gagtacggcaccctgggcctggagtgcagccccaggactggcctggacttcaacgaga tgatcctgctgaccatgaagaagaagagctggctggtgcacaagcagtggttcctgga cctgcccctgccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaag gagaggatggtgaccttcaagaccgcccacgccaagaagcaggacgtggtggtgctgg gcagccaggagggcgccatgcacaccgccctgaccggcgccaccgaggtggacagcgg cgacggcaacctgctgttcaccggccacctgaagtgcaggctgaagatggacaagctg cagctgaagggcatgagctacagcatgtgcaccggcaagttccagatcgtgaaggaga tcgccgagacccagcacggcaccatcctgatcaaggtggagtacaagggcgaggacgc agcctgcaagatccccttcagcaccgaggacgagaagggcaaggcccacaacggcagg ctgatcaccgccaaccccatcgtgatcgacaaggagaagcccgtgaacatcgaggccg agccccccttcggcgagagctacatcgtggtgggcgccggcgacaaggccctgaagct gagctggttcaagaagggcagcagcatcggccagatgttcgagagcacctacaggggc gccaagaggatggccatcctgggcgagaccgcctgggacttcggcagcgtgggcggcg tgttcaccagcatcggcaaggccctgcaccaggtgttcggcgccatctacggcgcc 302. 11B1-NPRM atgaggtgcatcggcatcagcaacagggacttcgtggagggcgtgagcggcgccacct Round II gggtggacgtggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcc shuffled caccctggacatcgagctgatcaagaccgaggccacccagcccgccaccctgaggaag clone no tactgcatcgaggccaagctgaccaataccaccaccgccaccaggtgccccacccagg PRM gcgaggccatcctgcccgaggagcaggaccagaactacgtgtgcaagcacacctacgt ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcgtggtgacctgc gccaagttccagtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctga agtacaccgtgatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgatac ccagggcgtgaccgtggagatcaccccccaggccagcaccgtggaggccatcctgccc gagtacggcaccctgggcctggagtgtagccccaggaccggcctggacttcaacgaga tgatcctgctgaccatgaagaacaaggcctggatggtgcacaggcagtggttcttcga cctgcccctgccctggaccagcggcgccaccaccgagacccccacctggaaccacaag gagaggatggtgaccttcaaggtgccccacgccaagaggcaggacgtgaccgtgctgg gcagccaggagggcgccatgcacaccgccctgaccggcgccaccgaggtggacagcgg cgacggcaaccacatgttcgccggccacctgaagtgcaggctgaagatggacaagctg cagctgaagggcatgagctacagcatgtgcaccggcaagttccagatcgtgaaggaga tcgccgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcgacggcag cccctgcaagatccccctggagatcatggacctggagaagaggcacgtgctgggcagg ctgatcaccgccaaccccatcgtgaccgagaaggacagccccgtgaacatcgaggccg agccccccttcggcgacagctacatcatcatcggcgtggagcccggccagctgaagct gcactggttcaggaagggcagcagcatcggccagatgttcgagagcacctacaggggc gccaagaggatggccatcctgggcgagaccgcctgggacttcggcagcgtgggcggcc agtgctgaccagcctgggcaaggccgtgcaccaggtgttcggcgccatctacggcgcc 303. 11B8-NPRM atgaggtgcgtgggcgtgggcaacagggacttcgtggagggcctgagcggcgccacct Round II gggtggacgtggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcc shuffled caccctggacatcgagctgctgaagaccgaggtgaccaaccccgccaccctgaggaag clone no tactgcatcgaggccaagctgaccaataccaccaccgagagcaggtgccccacccagg PPM gcgagccctacctgaaggaggagcaggaccagaactacgtgtgcaagcacacctacgt ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcgtggtgacctgc gccaagttccagtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctga agtacaccgtgatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgacac ccagggcgtgaccgtggagatcaccccccaggccagcaccgtggaggccatcctgccc gagtacggcaccctgggcctggagtgcagccccaggaccggcctggacttcaacaggg tggtgctgctgaccatgaagaagaagacctggctggtgcacaagcagtggttcctgga cctgcccctgccctggaccgccggcgccgacaccgagacccccacctggaacaggaag gagctgctggtgaccttcaagaacgcccacgccaagaagcaggaggtggtggtgctgg gcagccaggagggcgccatgcacagcgccctggccggcgccaccgagatccagatgag cagcggcaaccacatgttcgccggccacctgaagtgcaaggtgaggatggagaagctg cagetgaagggcatgagctacgtgatgtgcaccggcaagttccagatcgtgaaggaga tcgccgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcgacggcgc cccctgcaagatccccttcagcaccgaggacggccagggcaaggcccacaacggcagg ctgatcaccgccaaccccatcgtgatcgacaaggagaagcccgtgaacatcgagctgg agccccccttcggcgacagctacatcgtggtgggcgccggcgacaaggccctgaagat caactggtacaagaagggcagcagcatcggccagatgttcgaggccaccgccaggggc gccaggaggatggccatcctgggcgacaccgcctgggacttcggcagcgtgggcggcc tgctgaccagcctgggcaagatggtgcaccagatcttcggcagcgtgtacaccaccat gttcggc 304. 11C4-NPRM atgaggtgcgtgggcatcggcaaccgcgacttcgtggagggcctgagcggcgccacct Round II gggtggacgtggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcc shuffled caccctggacatcgagctgctgaagaccgaggtgaccaaccccgccaccctgaggacc clone no tactgcatcgaggccagcatcagcaacatcaccaccgagagcaggtgccccacccagg PPM gcgagccctacctgaaggaggagcaggaccagaactacgtgtgcaagcacacctacgt ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcgtggtgacctgc gccaagttccagtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctga agtacaccatcgtgatcaccccccacagcggcgaggagcacgccgtgggcaacgacac tggcaagcacggcaaggagatcaagatcaccccccagagcagcatcaccgaggccgag ctgaccggctacggcaccgtgaccatggagtgcagccccaggaccggcctggacttca acgagatgatcctgctgaccatgaagagcaagacctggctggtgcacaagcagtggtt cctggacctgcccctgccctggaccgccggcgccgacaccagccaggagacctggaac aggaaggagctgctggtgaccttcaagaacgcccacgccaagaagcaggaggtggtgg tgctgggcagccaggagggcgccatgcacaccgccctgaccggcgccaccgaggtgga cagcggcgacggcaacctgctgttcaccggccacctgaagtgcaggctgaagatggac aagctgcagctgaagggcatgagctacagcatgtgcaccggcaagttccagatcgtga aggagatcgccgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcga cggcagcccctgcaagatccccctggagatcatggacctggagaagaggcacgtgctg ggcaggctgatcaccgtgaaccccatcgtgaccgagaaggacagccccgtgaacatcg aggccgagccccccttcggcgacagctacatcatcatcggcgtgggcgacaaggccct gaagctgcactggttcaagaagggcagcagcatcggcaagatgttcgaggccaccgcc aggggcgccaagaggatggccatcctgggcgagaccgcctgggacttcggcagcgcct acacc 305. 11C11- atgaggtgcgtgggcgtgggcaacagggacttcgtggagggcctgagcggcgccacct NPRM gggtggacgtggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcc Round II caccctggacatcgagctgctgaagaccgaggtgaccaaccccgccgtgctgaggaag shuffled ctgtgcatcgaggccagcatcagcaacatcaccaccgagagcaggtgccccacccagg clone no gcgagccctacctgaaggaggagcaggaccagaactacgtgtgcaagcacacctacgt PRM ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgc gccaagttcaagtgcgtgaccaagctggagggcaagatcgtgcagtacgagaacctga agtacagcgtgatcgtgaccgtgcacaccggcgaccagcaccaggtgggcaacgagac ccagggcgtgaccgtggagatcaccccccaggccagcaccgtggaggccatcctgccc gagtacggcaccctgggcctggagtgcagccccaggaccggcctggacttcaacgaga tgatcctgctgaccatgaagaagaagacctggctggtgcacaagcagtggttcctgga cctgcccctgccctggaccgccggcgccgacaccagccaggagacctggaacaggaag gagctgctggtgaccttcaagaacgcccacgccaagaagcaggaggtggtggtgctgg gcagccaggagggcgccatgcacaccgccctgaccggcgccaccgaggtggacagcgg cgacggcaacctgctgttcaccggccacctgaagtgcaggctgaagatggacaagctg cagctgaagggcatgagctacagcatgtgcaccggcaagttccagatcgtgaaggaga tcgccgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcgaggacgc cccctgcaagatccccttcagcaccgaggacggccagggcaaggcccacaacggcagg ctgatcaccgccaaccccatcgtgaccgagaaggacagccccgtgaacatcgaggccg agccccccttcggcgacagctacatcgtggtgggcgccggcgagaaggccctgaagct gcactggttcaagaagggcagcagcatcggccagatgttcgaggccaccgccaggggc gccaggaggatggccatcctgggcgacaccgcctgggacttcggcagcgtgggcggcg tgttcaccagcatcggcaaggccctgcaccaggtgttcggcgccatctacggcgcc 306. 2H-NPRM atgaggtgcatcggcatcagcaacagggacttcgtggagggcgtgagcggcgccacct Round II gggtggacgtggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcc shuffled caccctggacatcgagctgatcaagaccgaggccacccagcccgccaccctgaggaag clone no tactgcatcgaggccagcatcagcaacatcaccaccgacagcaggtgccccacccagg PRM gcgagccctacctgaaggaggagcaggaccagcagtacatctgcaggagggacgtggt ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgc gccaagttcaagtgcgtgaccaagctggagggcaagatcgtgcagtacgagaacctga agtacagcgtgatcgtgaccgtgcacaccggcgaccagcaccaggtgggcaacgacac ccagggcgtgaccgtggagatcaccccccaggccagcaccgtggaggccatcctgccc gagtacggcaccctgggcctggagtgcagccccaggaccggcctggacttcaacgaga tgatcctgctgaccatgaagaagaagagctggctggtgcacaagcagtggttcctgga cctgcccctgccctggaccgccggcgccgacaccgagacccccacctggaacaggaag gagctgctggtgaccttcaagaacgcccacgccaagaagcaggaggtggtggtgctgg gcagccaggagggcgccatgcacaccgccctgaccggcgccaccgagatccagatgag cagcggcaacctgctgttcaccggccacctgaagtgcaggctgaagatggacaagctg cagctgaagggcatgagctacagcatgtgcaccggcaagttccagatcgtgaaggaga tcgccgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcgacggcgc cccctgcaagatccccttcagcaccgaggacgagaagggcgtgacccagaacggcagg ctgatcaccgccaaccccatcgtgaccgagaaggacagccccgtgaacatcgaggccg agccccccttcggcgacagctacatcatcatcggtgtggagcccggccagctgaagct gcactggttcaagaagggcagcagcatcggccagatgttcgaggccaccgccaggggc ggccaggaggatggccatcctgggcgacaccgcctgggacttcggcagcgcctacac 307. 13F11- atgaggtgcgtgggcatcggcaaccgcgacttcgtggagggcgtgagcggcggcgcct NPRM gggtggacctggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcc Round II caccctggacatcgagctgatcaagaccgaggccacccagcccgccaccctgaggaag shuffled tactgcatcgaggccaagatcaccaacatcaccaccgacagcaggtgccccacccagg clone no gcgagccctacctgaaggaggagcaggaccagcagtacatctgcaggagggacgtggt PRM ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgc gccaagttcaagtgcgtgaccaagctggagggcaagatcgtgcagtacgagaacctga agtacagcgtgatcgtgaccgtgcacaccggcgaccagcaccaggtgggcaacgagac caccgagcacggcaccatcgccaccatcaccccccaggcccccaccagcgagatccag ctgaccgactacggcgccctgaccctggactgcagccccaggaccggcctggacttca acagggtggtgctgctgaccatgaagaagaagacctggctggtgcacaagcagtggtt cctggacctgcccctgccctggaccgccggcgccgacaccagcgaggtgcactggaac cacaaggagaggatggtgaccttcaaggtgccccacgccaagaggcaggacgtgaccg tgctgggcagccaggagggcgccatgcacagcgccctggccggcgccaccgaggtgga cagcggcgacggcaaccacatgttcgccggccacctgaagtgcaggctgaagatggac aagctgcagctgaagggcatgagctacagcatgtgcaccggcaagttccagatcgtga aggagatcgccgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcga cggcgccccctgcaagatccccttcagcaccgaggacgagaagggcaaggcccacaac ggcaggctgatcaccgccaaccccatcgtgatcgacaaggagaagcccgtgaacatcg agctggagccccccttcggcgagagctacatcgtggtgggcgccggcgagaaggccct gaagctgagctggttcaagaagggcagcagcatcggcaagatgttcgagagcacctac aggggcgccaagaggatggccatcctgggcgacaccgcctgggacttcggcagcgtgg gcggcgtgttcaccagcatcggcaaggccctgcaccaggtgttcggcgccatctacgg cgcc 308. 14B1-NPRM atgaggtgcgtgggcatcagcaacagggacttcgtggagggcgtgagcggcggcgcct Round II gggtggacctggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcc shuffled caccctggacatcgagctgctgaagaccgaggtgaccaaccccgccaccctgaggaag clone no tactgcatcgaggccaagatcaccaacatcaccaccgacagcaggtgccccacccagg PRM gcgagccctacctgaaggaggagcaggaccagcagtacatctgcaggagggacgtggt ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgc gccaagttcaagtgcgtgaccaagctggagggcaacatcgtgcagcccgagaacctga agtacaccatcgtgatcaccccccacaccggcgaccagcaccaggtgggcaacgacac ccagggcgtgaccgtggagatcaccccccaggccagcaccgtggaggccatcctgccc gagtacggcaccctgggcctggagtgcagccccaggaccggcctggacttcaacgaga tgatcctgctgaccatgaagaagaagagctggctggtgcacaagcagtggttcctgga cctgcccctgccctggaccgccggcgccgacaccgagacccccacctggaacaggaag gagctgctggtgaccttcaagaacgcccacgccaagaggcaggacgtgaccgtgctgg gcagccaggagggcgccatgcacagcgccctgaccggcgccaccgaggtggacagcgg cgacggcaacctgctgttcaccggccacctgaagtgcaggctgaagatggacaagctg cagctgaagggcatgagctacaccatgtgcaccggcaagttccagatcgtgaaggaga tcgccgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcgacggcgc cccctgcaagatccccttcagcaccgaggacgagaagggcgtgacccagaacggcagg ctgatcaccgccaaccccatcgtgaccgagaaggacagccccgtgaacatcgaggccg agccccccttcggcgagagctacatcgtggtgggcgccggcgagaaggccctgaagct gagctggttcaagaagggcagcagcatcggccagatgttcgagagcaccgccaggggc gccaggaggatggccatcctgggcgacaccgcctgggacttcggcagcgcctacacc 309. 14G10- atgaggtgcatcggcatcagcaacagggacttcgtggagggcgtgagcggcgccacct NPRM gggtggacgtggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcc Round II caccctggacatcgagctgctgaagaccgaggtgaccaaccccgccaccctgaggaag shuffled tactgcatcgaggccaagatcaccaacatcaccaccgacaccaggtgccccacccagg clone no gcgaggccatcctgcccgaggagcaggaccagcagtacatctgcaggagggacgtggt PRM ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcctggtgacctgc gccaagttcacctgcaagaagaacatggagggcaacatcgtgcagcccgagaacctgg agtacaccatcgtgatcaccccccacagcggcgaggagcacgccgtgggcaacgagac caccgagcacggcaccatcgccaccatcaccccccaggcccccaccagcgagatccag ctgaccgactacggcgccctgaccctggactgcagccccaggaccggcctggacttca acagggtggtgctgctgaccatgaagaagaagacctggctggtgcacaagcagtggtt cctggacctgcccctgccctggaccgccggcgccgacaccagcgaggtgcactggaac cacaaggagaggatggtgaccttcaaggtgccccacgccaagaggcaggacgtgaccg tgctgggcagccaggagggcgccatgcacagcgccctggccggcgccaccgagatcca gatgagcagcggcaacctgctgttcaccggccacctgaagtgcaggctgaagatggac aagctgcagctgaagggcatgagctacagcatgtgcaccggcaagttccagatcgtga aggagatcgccgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcga cggcgccccctgcaagatccccttcagcaccgaggacggccagggcaaggcccacaac ggcaggctgatcaccgccaaccccatcgtgatcgacaaggagaagcccgtgaacatcg agctggagccccccttcggcgacagctacatcgtggtgggcgccggcgacaaggccct gaagatcaactggtacaagaagggcagcagcatcggcaagatgttcgagagcacctac aggggcgccaagaggatggccatcctgggcgagaccgcctgggacttcggcagcgtgg gcggcgtgttcaccagcatcggcaaggccctgcaccaggtgttcggcgccatctacgg cgcc 310. 14H2-NPRM atgaggtgcgtgggcgtgggcaacagggacttcgtggagggcgtgagcggcggcgcct Round II gggtggacctggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcc shuffled caccctggacttcgagctgatcaagaccaccgccaaggaggtggccctgctgaggacc clone no tactgcatcgaggccaagatcaccaacatcaccaccgccaccaggtgccccacccagg PRM gcgagccctacctgaaggaggagcaggaccagaactacgtgtgcaagcacacctacgt ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgc gccaagttccagtgcgtgaccaagctggagggcaacatcgtgcagcccgagaacctgg agtacaccatcgtgatcaccccccacagcggcgaggagcacgccgtgggcaacgacac ccagggcgtgaccgtggagatcaccccccaggccagcaccgtggaggccatcctgacc gactacggcgccctgaccctggactgcagccccaggaccggcctggacttcaacaggg tggtgctgctgaccatgaagaagaagacctggctggtgcacaagcagtggttcctgga cctgcccctgccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaag gagaggatggtgaccttcaaggtgccccacgccaagaagcaggaggtggtggtgctgg gcagccaggagggcgccatgcacaccgccctgaccggcgccaccgaggtggacagcgg cgacggcaacctgctgttcaccggccacctgaagtgcaggctgaagatggacaagctg cagctgaagggcatgagctacagcatgtgcaccggcaagttccagatcgtgaaggaga tcgccgagacccagcacggcaccatcgtgatcagggtgcagtacaagggcgaggacgc cccctgcaagatccccttcagcagccaggacggccagggcaaggcccacaacggcagg ctgatcaccgccaaccccatcgtgaccgagaaggacagccccgtgaacatcgaggccg agccccccttcggcgagagctacatcgtggtgggcgccggcgagaaggccctgaagct gagctggttcaagaagggcagcagcatcggccagatgttcgagaccaccatgaggggc gccaagaggatggccatcctgggcgagaccgcctgggacttcggcagcgtgggcggcg tgttcaccagcatcggcaaggccctgcaccaggtgttcggcgccatctacggcgcc 311. 15C2-NPRM atgaggtgcgtcggcatcagcaacagggacttcgtggagggcctgagcggcgccacct Round II gggtggacgtggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcc shuffled caccctggacatcgagctgctgaagaccgaggtgaccaaccccgccaccctgaggaag clone no tactgcatcgaggccaagatcaccaacatcaccaccgacagcaggtgccccacccagg PRM gcgagccctacctgaaggaggagcaggaccagcagtacatctgcaggagggacgtggt ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcgtggtgacctgc gccaagttccagtgcctggagcccatcgagggcaagatcgtgcagtacgagaacctga agtacagcgtgatcgtgaccgtgcacaccggcgaccagcaccaggtgggcaacgacac ccagggcgtgaccgtggagatcaccccccaggccagcaccgtggaggccatcctgccc gagtacggcaccctgggcctggagtgcagccccaggactggcctggacttcaacgaga tgatcctgctgaccatgaagaacaaggcctggatggtgcacaggcagtggttcttcga cctgcccctgccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaag gagaggatggtgaccttcaaggtgccccacgccaagaggcaggacgtgaccgtgctgg gcagccaggagggcgccatgcacagcgccctggccggcgccaccgaggtggacagcgg cgacggcaaccacatgttcgccggccacctgaagtgcaggctgaagatggacaagctg cagctgaagggcatgagctacgtgatgtgcaccggcaagttccagatcgtgaaggaga tcgccgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcgacggcgc cccctgcaagatccccttcagcaccgaggacgagaagggcgtgacccagaacggcagg ctgatcaccgccaaccccatcgtgatcgacaaggagaagcccgtgaacatcgaggccg agccccccttcggcgagagctacatcgtcatcggcgtggagcccggccagctgaagct gcactggttcaagaagggcagcagcatcggccagatgttcgaggccaccgccaggggc gccaggaggatggccatcctgggcgagaccgcctgggacttcggcagcgtgggcggcc tgctgaccagcctgggcaaggccgtgcaccaggtgttcggcagcgtgtacaccaccat gttcggc 312. 15D4-NPRM atgaggtgcgtgggcatcggcaaccgcgacttcgtggagggcctgagcggcgccacct Round II gggtggacgtggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcc shuffled caccctggacttcgagctgatcaagaccaccgccaaggaggtggccctgctgaggacc clone no tactgcatcgaggccaagatcaccaacatcaccaccgacagcaggtgccccacccagg PRM gcgaggccatcctgcccgaggagcaggaccagaactacgtgtgcaagcacacctacgt ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgc gccaagttcaagtgcctggagcccatcgagggcaaggtggtgcagcccgagaacctgg agtacaccatcgtgatcaccccccacagcggcgaggagcacgccgtgggcaacgacac tggcaagcacggcaaggagatcaagatcaccccccagagcagcatcaccgaggccgag ctgaccggctacggcaccgtgaccatggagtgcagccccaggaccggcctggacttca acagggtggtgctgctgaccatgaagaagaagagctggctggtgcacaagcagtggtt cctggacctgcccctgccctggaccgccggcgccgacaccagcgaggtgcactggaac cacaaggagaggatggtgaccttcaagaccgcccacgccaagaagcaggacgtggtgg tgctgggcagccaggagggcgccatgcacagcgccctggccggcgccaccgagatcca gatgagcagcggcaacctgctgttcaccggccacctgaagtgcaggctgaggatggac aagctgaccctgaagggcgtgagctacgtgatgtgcaccggcagcttcaagctggaga aggaggtggccgagacccagcacggcaccgtgctggtgcaggtgaagtacgagggcac cgacgccccctgcaagatccccttcagcaccgaggacgagaagggcgtgacccagaac ggcaggctgatcaccgccaaccccatcgtgaccgagaaggacagccccgtgaacatcg aggccgagccccccttcggcgagagctacatcgtggtgggcgccggcgacaaggccct gaagatcaactggtacaagaagggcagcagcatcggcaagatgttcgagagcacctac aggggcgccaagaggatggccatcttgggcgacaccgcctgggacttcggcagcctgg gcggcgtgttcaccagcatcggcaaggccctgcaccaagtgttcggcgccatctacgg cgcc 313. 15H4-NPRM atgaggtgcgtgggcgtgggcaacagggacttcgtggagggcctgagcggcgccacct Round II gggtggacgtggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcc shuffled caccctggacttcgagctgatcaagaccgaggtgaccaaccccgccaccctgaggaag clone no tactgcatcgaggccagcatcagcaacatcaccaccgccaccaggtgccccacccagg PRM gcgaggccaacctgcccgaggagcaggaccagaactacgtgtgcaagcacacctacgt ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcctggtgacctgc gccaagttccagtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctga agtacaccgtgatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgacac ccagggcgtgaccgtggagatcaccccccaggccagcaccgtggaggccatcctgccc gagtacggcaccctgggcctggagtgcagccccaggaccggcctggacttcaacaggg tggtgctgctgaccatgaagaagaagagctggctggtgcacaagcagtggttcctgga cctgcccctgccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaag gagaggatggtgaccttcaagaccgcccacgccaagaagcaggaggtggtggtgctgg gcagccaggagggcgccatgcacagcgccctggccggcgccaccgagatccagatgag cagcggcaacctgctgttcaccggccgcctgaagtgcaggctgaagatggacaagctg cagctgaagggcatgagctacgtgatgtgcaccggcaagttccagatcgtgaaggaga tcgccgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcgacggcgc cccctgcaagatccccctggagatcatggacctggagaagaggcacgtgctgggcagg ctgatcaccgtgaaccccatcgtgatcgacaaggagaagcccgtgaacatcgaggccg agcccccctttggcgacagctacatcatcatcggcgtggagcccggccagctgaagct gcactggttcaagaagggcagcagcatcggccagatgttcgagagcacctacaggggc gccaagaggatggccatcctgggcgacaccgcctgggacttcggcagcgtgggcggcg tgctgaacagcctgggcaagatggtgcaccagatcttcggcagcgtgtacaccaccat gttcggc 314. 16B4-NPRM atgaggtgcgtgggcgtgggcaacagggacttcgtggagggcctgagcggcgccacct Round II gggtggacgtggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcc shuffled caccctggacatcgagctgctgaagaccgaggtgaccaaccccgccgtgctgaggaag clone no ctgtgcatcgaggccagcatcagcaacatcaccaccgacagcaggtgccccacccagg PRM gcgaggccatcctgcccgaggagcaggaccagaactacgtgtgcaagcacacctacgt ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgc gccaagttcaagtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctga agtacaccgtgatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgacac ccagggcgtgaccgtggagatcaccccccaggccagcaccgtggaggccatcctgccc gagtacggcaccctgggcctggagtgcagccccaggaccggcctggacttcaacgaga tgatcctgctgaccatgaagaagaagacctggctggtgcacaagcagtggttcctcga cctgcccctgccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaag gagaggatggtgaccttcaagaccgcccacgccaagaagcaggacgtggtggtgctgg gcagccaggagggcgccatgcacaccgccctgaccggcgccaccgagatccagatgag cagcggcaccaccaccatcttcgccggccacctgaagtgcaggctgaagatggacaag ctgcagctgaagggcatgagctacaccatgtgcaccggcaagttccagatcgtgaagg agatcgccgagacccagcacggcaccatcctgatcaaggtggagtacaagggcgagga cgccccctgcaagatccccttcagcaccgaggacgagaagggcgtgacccagaacggc aggctgatcaccgccaaccccatcgtgaccgagaaggacagccccgtgaacatcgagg ccgagccccccttcggcgagagctacatcgtggtcggcgtgggcgacaaggccctgaa gatcaactggtacaagaagggcagcagcatcggccagatgttcgagaccaccatgagg ggcgccaagaggatggccatcctgggcgacaccgcctgggacttcggcagcgtgggcg gcctgctgaccagcctgggcaagatggtgcaccagatcttcggcagcgtgtacaccac catgttcggc 315. 16F12- atgaggtgcgtgggcgtgggcaacggggacttcgtggagggcctgagcggcgccacct NPRM gggtggacgtggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcc Round II caccctggacttcgagctgatcaagaccatcgccaaggaggtggccctgctgaggacc shuffled tactgcatcgaggccagcatcagcaacatcaccaccgacagcaggtgccccacccagg clone no gcgaggccatcctgcccgaggagcaggaccagaactacgtgtgcaagcacacctacgt PRM ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcgtggtgacctgc gccaagttccagtgcctggagcccatcgagggcaaggtggtgcagcccgagaacctgg agtacaccatcgtgatcaccccccacaccggcgaccagcaccaggtgggcaacgacac ccagggcgtgaccgtggagatcaccccccaggccagcaccgtggaggccatcctgccc gagtacggcaccctgggcctggagtgcagccccaggaccggcctggacttcaacgaga tgatcctgctgaccatgaagaacaaggcctggatggtgcacaggcagtggttcttcga cctgcccctgccctggaccagcggcgccaccaccgaggtgcactggaaccacaaggag aggatggtgaccttcaagaccgcccacgccaagaagcaggacgtggtggtgctgggca gccaggagggcaccatgcacaccgccctgaccggcgccaccgagatccagatgagcag cggcaccaccaccatcttcgccggccacctgaagtgcaaggtgaagatggacaagctg cagctgaagggcatgagctacaccatgtgcaccggcaagttccagatcgtgaaggaga tcgccgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcgacggcgc cccctgcaagatccccctggagatcatggacctggagaagaggcacgtgctgggcagg ctgatcaccgtgaaccccatcgtgatcgacaaggagaagcccgtgaacatcgaggccg agccccccttcggcgagagctacatcgtggtcggcgtgggcgacagcgccctgaagat caactggtacaagaagggcagcagcatcggcaagatgttcgagagcacctacaggggc gccaagaggatggccatcctgggcgagaccgcctgggacttcggcagcgtgggcggcg tgttcaccagcatcggcaaggccctgcaccaggtgttcggcgccatctacggcgcc 316. 16G11- atgaggtgcgtgggcatcggcaaccgcgacttcgtggagggcctgagcggcgccacct NPRM gggtggacgtggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcc Round II caccctggacatcgagctgctgaagaccgaggtgaccaaccccgccgtgctgaggaag shuffled ctgtgcatcgaggccagcatcagcaacaccaccaccgacagcaggtgccccacccagg clone no gcgaggccaccctggtggaggagcaggacaccaacttcgtgtgcaggaggaccttcgt PRM ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcgtggtgacctgc gccaagttccagtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctga agtacaccgtgatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgacac ccagggcgtgaccgtggagatcaccccccaggccagcaccgtggaggccatcctgccc gagtacggcaccctgggcctggagtgcagccccaggaccggcctggacttcaacgaga tgatcctgctgaccatgaagaacaaggcctggctggtgcacaagcagtggttcctgga aggagatcgccgagacccagcacggcaccatcgtgatcaaggtggagtacaagggcga ggacgccccctgcaagatccccttcagcaccgaggacggccagggcaaggcccacaac ggcaggctgatcaccgccaaccccatcgtgatcgacaaggagaagcccgtgaacatcg aggccgagccccccttcggcgagagctacatcgtggtgggcgccggcgagaaggccct gaagctgagctggttcaagaagggcagcagcatcggccagatgttcgagaccaccatg aggggcgccaagaggatggccatcctgggcgacaccgcctgggacttcggcagcgtgg gcggcctgctgaccagcctgggcaagatggtgcaccagatcttcggcagcgtgtacac caccatgttcggc 326. 18A9-NPRM atgaggtgcgtgggcatcggcaaccgcgacttcgtggagggcctgagcggcgccacct Round II gggtggacgtggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcc shuffled caccctggacatcgagctgctgaagaccgaggtgaccaaccccgccaccctgaggaag clone no tactgcatcgaggccaagatcaccaacatcaccaccgccaccaggtgccccacccagg PRM gcgaggccatcctgcccgaggagcaggaccagaactacgtgtgcaagcacagcatggt ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcgtggtgacctgc gccaagttccagtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctga agtacaccgtgatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgacac ccagggcgtgaccgtggagatcaccccccaggccagcaccgtggaggccatcctgccc gagtacggcaccctgggcctggagtgcagccccaggactggcctggacttcaacgaga tgatcctgctgaccatgaagaacaaggcctggatggtgcacaagcagtggttcctgga cctgcccctgccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaag gagaggatggtgaccttcaaggtgccccacgccaagaggcaggacgtgaccgtgctgg gcagccaggagggcgccatgcacagcgccctggccggcgccaccgagatccagatgag cagcggcaacctgctgttcaccggccacctgaagtgcaggctgaggatggacaagctg accctgaagggcgtgagctacgtgatgtgcaccggcaagttccagatcgtgaaggaga tcgccgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcaccgacgc cccctgcaagatccccttcagcagccaggacgagaagggcgtgacccagaacggcagg ctgatcaccgccaaccccatcgtgatcgacaaggagaagcccgtgaacatcgagctgg agccccccttcggcgacagctacatcgtggtgggcgccggcgacaaggccctgaccct gcactggttcaggaagggcagcagcatcggccagatgttcgagaccaccatgaggggc gccaagaggatggccatcctgggcgacaccgcctgggacttcggcagcgtgggcggcc tgctgaccagcctgggcaagatggtgcaccagatcttcggcagcgtgttcaccagcgt gggcaag 327. 18B7-NPRM atgaggtgcatcggcatcagcaacagggacttcgtggagggcctgagcggcgccacct Round II gggtggacgtggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcc shuffled caccctggacttcgagctgatcaagaccaccgccaaggaggtggccctgctgaggacc clone no tactgcatcgaggccagcatcagcaacatcaccaccgccaccaggtgccccacccagg PRM gcgagccctacctgaaggaggagcaggaccagcagtacatctgcaggagggacgtggt ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcgtggtgacctgc gccaagttccagtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctga agtacaccgtgatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgacac ccagggcgtgaccgtggagatcaccccccaggccagcaccgtggaggccatcctgccc gagtacggcaccctgggcctggagtgcagccccaggaccggcctggacttcaacgaga tgatcctgctgaccatgaagaacaaggcctggatggtgcacaagcagtggttcctgga cctgcccctgccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaag gagaggatggtgaccttcaagaccgcccacgccaagaagcaggacgtggtggtgctgg gcagccaggagggcgccatgcacaccgccctgaccggcgccaccgaggtggacagcgg cgacggcaacctgctgttcaccggccacctgaagtgcaggctgaggatggacaagctg cagctgaagggcatgagctacagcatgtgcaccggcaagttccagatcgtgaaggaga tcgccgagacccagcacggcaccatcctgatcaaggtggagtacaagggcgaggacgc cccctgcaagatccccttcagcaccgaggacgagaagggcgtgacccagaacggcagg ctgatcaccgccaaccccatcgtgatcgacaaggagaagcccgtgaacatcgaggccg agccccccttcggcgacagctacatcgtggtgggcgccggcgacaaggccctgaagat caactggtacaagaagggcagcagcatcggcaagatgttcgagagcacctacaggggc gccaagaggatggccatcctgggcgagaccgcctgggacttcggcagcgtgggcggcc cctgcccctgccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaag gagaggatggtgaccttcaagaccgcccacgccaagaagcaggacgtggtggtgctgg gcagccaggagggcgccatgcacaccgccctgaccggcgccaccgagatccagatgag cagcggcaacctgctgttcaccggccacctgaagtgcaggctgaggatggacaagctg cagctgaagggcatgagctacgtgatgtgcaccggcaagttccagatcgtgaaggaga tcgccgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcgacggcgc cccctgcaagatccccttcagcaccgaggacggccagggcaaggcccacaacggcagg ctgatcaccgccaaccccatcgtgatcgacaaggagaagcccgtgaacatcgagctgg agccccccttcggcgacagctacatcgtggtgggcgccggcgacaaggccctgaagat caactggtacaagaagggcagcagcatcggcaagatgttcgaggccaccgccaggggc gccaggaggatggccatcctgggcgacaccgcctgggacttcggcagcctgggcggcc tgctgaccagcctgggcaaggccgtgcaccaggtgttcggcagcgtgtacaccaccat gttcggc 317. 17A12- atgaggtgcgtgggcgtgggcaacagggacttcgtggagggcctgagcggcgccacct NPRM gggtggacgtggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcc Round II caccctggacttcgagctgatcaagaccgaggccacccagcccgccaccctgaggaag shuffled tactgcatcgaggccaagatcaccaacatcaccaccgacagcaggtgccccacccagg clone no gcgaggccatcctgcccgaggagcaggaccagaactacgtgtgcaagcacacctacgt PRM ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgc gccaagttcaagtgcctggagcccatcgagggcaaggtggtgcagcccgagaacctgg agtacaccatcgtgatcaccccccacaccggcgaccagcaccaggtgggcaacgacac ccagggcgtgaccgtggagatcaccccccaggccagcaccgtggaggccatcctgccc gagtacggcaccctgggcctggagtgcagccccaggactggcctggacttcaacgaga tgatcctgctgaccatgaagaagaagagctggctggtgcacaagcagtggttcctgga cctgcccctgccctggaccagcggcgccaccaccgagacccccacctggaacaggaag gagctgctggtgaccttcaagaacgcccacgccaagaggcaggacgtgaccgtgctgg gcagccaggagggcgccatgcacagcgccctgaccggcgccaccgagatccagaccag cggcaccaccaccatcttcgccggccacctgaagtgcagggtgaggatggacaagctg cagctgaagggcatgagctacgtgatgtgcaccggcagcttcaagctggagaaggagg tggccgagacccagcacggcaccgtgctggtgcaggtgaagtacgagggcaccgacgc cccctgcaagatccccgtcagcagccaggacgagaagggcgtgacccagaacggcagg ctgatcaccgccaaccccatcgtgaccgagaaggacagccccgtgaacatcgaggccg agccccccttcggcgacagctacatcatcatcggcgtggagcccggccagctgaagct gcactggttcaagaagggcagcagcatcggccagatgttcgagaccaccatgaggggc gccaagaggatggccatcctgggcgacaccgcctgggacctcggcagcgtgggcggcg tgttcaccagcatcggcaaggccctgcaccaggtgttcggcgccatctacggcgcc 318. 17D5-NPRM atgaggtgcatcggcatcagcaacagggacttcgtggagggcgtgagcggcgccacct Round II gggtggacgtggtgctggagcacggcggctgcgtgaccaccatggccaggaacaagcc shuffled caccctggacatcgagctgatcaagaccgaggccacccagcccgccaccctgaggaag clone no tactgcatcgaggccagcatcagcaacatcaccaccgccaccaggtgccccacccagg PRM gcgaggccatcctgcccgaggagcaggaccagcagtacatctgcaggagggacgtggt ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcgtggtgacctgc gccaagttccagtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctga agtacaccgtgatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgacac ccagggcgtgaccgtggagatcaccccccaggcccccaccagcgagatccagctgacc gactacggcgccctgaccctggactgcagccccaggaccggcctggacttcaacaggg tggtgctgctgaccatgaagaagaagacctggctggtgcacaagcagtggtccctgga cctgcccctgccctggaccagcggcgccaccaccgagacccccacctggaacaggaag gagctgctggtgaccttcaagaacgcccacgccaagaagcaggaggtggtggtgctgg gcagccaggagggcgccatgcacaccgccctgaccggcgccaccgaggtggacagcgg cgacggcaacctgctgttcaccggccacctgaagtgcaggctgaagatggacaagctg cagctgaagggcatgagctacaccatgtgcaccggcaagttccagatcgtgaaggaga tcgccgagacccagcacggcaccatcctgatcaaggtggagtacaagggcgaggacgc cccctgcaagatccccttcagcaccgaggacgagaagggcgtgacccagaacggcagg ctgatcaccgccaaccccatcgtgaccgagaaggacagccccgtgaacatcgaggccg agccccccttcggcgagagctacatcgtggtgggcgccggcgacaaggccctgaagat caactggtacaagaagggcagcagcatcggcaagatgttcgagagcacctacaggggc gccaagaggatggccatcttgggcgacaccgcctgggacttcggcagcctgggcggcg tgttcaccagcatcggcaaggccctgcaccaggtgttcggcgccatctacggcgcc 319. 17D11- atgaggtgcgtgggcaccggcaaccgcgacttcgtggagggcctgagcggcgccacct NPRM gggtggacgtggtgctggagcacggcggctgcgtgaccaccatggcccagggcaagcc Round II caccctggacatcgagctgatcaagaccgaggccacccagcccgccaccctgaggaag shuffled tactgcatcgaggccaagctgaccaataccaccaccgagagcaggtgccccacccagg clone no gcgagccctacctgaaggaggagcaggaccagaactacgtgtgcaagcacacctacgt PRM ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcgtggtgacctgc gccaagttccagtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctga agtacaccgtgatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgacac ccagggcgtgaccgtggagatcaccccccaggccagcaccgtggaggccatcctgccc gagtacggcaccctgggcctggagtgcagccccaggactggcctggacttcaacgaga tgatcctgctgaccatgaagaagaagagctggctggtgcacaagcagtggttcctgga cctgcccctgccctggaccgccggcgccagcaccagccaggagacctggaacaggaag gagctgctggtgaccttcaagaacgcccacgccaagaagcaggaggtggtggtgctgg gcagccaggagggcgccatgcacaccgccctgaccggcgccaccgaggtggacagcgg cgacggcaacctgctgttcaccggccacctgaagtgcaggctgaagatggacaagctg cagctgaagggcatgagctacaccatgtgcaccggcaagttccagatcgtgaaggaga tcgccgagacccagcacggcaccatcctgatcaaggtggagtacaagggcaccgacgc cccctgcaagatccccttcagcagccaggacgagaagggcgtgacccagaacggcagg ctgatcaccgccaaccccatcgtgaccgagaaggacagccccgtgaacatcgaggccg agccccccttcggcgagagctacatcgtggtgggcgccggcgacaaggccctgaagat caactggtacaagaagggcagcagcatcggcaagatgttcgagagcacctacaggggc gccaagaggatggccatcctgggcgagaccgcctgggacttcggcagcgtgggcggcg tgttcaccagcatcggcaaggccctgcaccaggtgttcggcgccatctacggcgcc 320. 17F5-NPRM atgaggtgcgtgggcgtgggcaacagggacttcgtggagggcctgagcggcgccacct Round II gggtggacgtggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcc shuffled caccctggacatcgagctgctgaagaccgaggtgaccaaccccgccaccctgaggaag clone no tactgcatcgaggccaagatcaccaacatcaccaccgacagcaggtgccccacccagg PRM gcgaggccatcctgcccgaggagcaggaccagaactacgtgtgcaagcacacctacgt ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcgtggtgacctgc gccaagttccagtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctga agtacaccgtgatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgacac ccagggcgtgaccgtggagatcaccccccaggccagcaccgtggaggccatcctgccc gagtacggcaccctgggcctggagtgcagccccaggaccggcctggacttcaacaggg tggtgctgctgaccatgaagaagaagagctggctggtgcacaagcagtggttcctgga cctgcccctgccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaag gagaggatggtgaccttcaagaccgcccacgccaagaagcaggacgtggtggtgctgg gcagccaggagggcgccatgcacagcgccctggccggcgccaccgagatccagatgag cagcggcaacctgctgttcaccggccacctgaagtgcaggctgaggatggacaagctg cagctgaagggcatgagctacgtgatgtgcaccggcaagttccagatcgtgaaggaga tcgccgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcgacgacgc cccctgcaagatccccttcagcaccgaggacgagaagggcgtgacccagaacggcagg ctgatcaccgccaaccccatcgtgaccgagaaggacagccccgtgaacatcgaggccg agccccccttcggcgagagctacatcgtggtcggcgtgggcgacagcgccctgaccct gcactggttcaggaagggcagcagcatcggccagatgttcgagaccaccatgaggggc gccaagaggatggccatcttgggcgacaccgcctgggacttcggcagcctgggcggcg tgttcaccagcctgggcaaggccgtgcaccagatcttcggcagcgtgtacaccaccat gttcggc 321. 17F11- atgaggtgcatcggcatcagcaacagggacttcgtggagggcgtgagcggcgccacct NPRM gggtggacgtggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcc Round II caccctggacttcgagctgatcaagaccaccgccaaggaggtggccctgctgaggacc shuffled tactgcatcgaggccagcatcagcaacatcaccaccgccaccaggtgccccacccagg clone no gcgaggccatcctgcccgaggagcaggaccagaactacgtgtgcaagcacacdtacgt PRM ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgc gccaagttccagtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctga agtacaccgtgatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgacac ccagggcgtgaccgtggagatcaccccccaggccagcaccgtggaggccatcctgccc gagtacggcaccctgggcctggagtgcagccccaggaccggcctggacttcaacgaga tgatcctgctgaccatgaagaacaaggcctggatggtgcacaagcagtggctcctgga cctgcccctgccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaag gagaggatggtgaccttcaaggtgccccacgccaagaagcaggacgtggtggtgctag gcagccaggagggcgccatgcacaccgccctgaccggcgccaccgagatccagatgag cagcggcaacctgctgttcaccggccacctgaagtgcaggctgaggatggacaagctg cagctgaagggcatgagctacagcatgtgcaccggcaagttccagatcgtgaaggaga tcgccgagacccagcacggcaccatcctgatcaaggtggagtacaagggcgaggacgc cccctgcaagatccccttcagcaccgaggacgagaagggcgtgacccacaacggcagg ctgatcaccgccaaccccatcgtgatcgacaaggagaagcccgtgaacatcgagctgg agccccccttcggcgacagctacatcgtgatcggcgtgggcgacagcgccctgaccct gcactggttcaggaagggcagcagcatcggcaagatgttcgagagcacctacaggggc gccaagaggatggccatcctgggcgagaccgcctgggacttcggcagcgtgggcggcc tgctgaccagcctgggcaaggccgtgcaccaggtgttcggcagcgtgtacaccaccat gttcggc 322. 17G5-NPRM atgaggtgcgtgggcatcggcaaccgcgacttcgtggagggcctgagcggcgccacct Round II gggtggacgtggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcc shuffled caccctggacttcgagctgatcaagaccaccgccaaggaggtggccctgctgaggacc clone no tactgcatcgaggccagcatcagcaacatcaccaccgccaccaggtgccccacccagg PRM gcgaggccatcctgaaggaggagcaggaccagaactacgtgtgcaagcacacctacgt ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcgtggtgacctgc gccaagttccagtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctga agtacaccgtgatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgacac ccagggcgtgaccgtggagatcaccccccaggccagcaccgtggaggccatcctgccc gagtacggcaccctgggcctggagtgcagccccaggaccggcctggacttcaacgaga tgatcctgctgaccatgaagaacaaggcctggatggtgcacaggcagtggttcttcga cctgcccctgccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaag gagaggatggtgaccttcaagaccgcccacgccaagaagcaggaggtggtggtgctgg gcagccaggagggcgccatgcacagcgccctggccggcgccaccgaggtggacagcgg cgacggcaaccacatgttcgccggccacctgaagtgcaaggtgaggatggacaagctg accctgaagggcgtgagctacgtgatgtgcaccggcagcttcaagctggagaaggaga tcgccgagacccagcacggcaccatcctgatcaaggtggagtacaagggcgaggacgc agcctgcaagatccccttcagcaccgaggacggccagggcaaggcccacaacggcagg ctgatcaccgccaaccccatcgtgatcgacaaggagaagcccgtgaacatcgagctgg agccccccttcggcgacagctacatcgtggtgggcgccggcgacaaggccctgaagat caactggtacaagaagggcagcagcatcggcaagatgttcgagagcacctacaggggc gccaagaggatggccatcctgggcgagaccgcctgggacttcggcagcgtgggcggcg tgttcaccagcatcggcaaggccctgcaccaggtgttcggcgccatctacggcgcc 323. 17H3-NPRM atgaggtgcatcggcatcagcaacagggacttcgtggagggcgtgagcggcgccacct Round II gggtggacgtggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcc shuffled caccctggacatcgagctgctgaagaccgaggtgaccaaccccgccaccctgaggaag clone no tactgcatcgaggccaagatcaccaacatcaccaccgccaccaggtgccccacccagg PRM gcgaggccatcctgcccgaggagcaggaccagaactacgtgtgcaagcacacctacgt ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcctggtgacctgc gccaagttccagtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctga agtacaccgtgatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgacac ccagggcgtgaccgtggagatcaccccccaggccagcaccgtggaggccatcctgccc gagtacggcaccctgggcctggagtgcagccccaggactggcctggacttcaacgaga tgatcctgctgaccatgaagaagaagacctggatggtgcacaggcagtggttcttcga cctgcccctgccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaag gagaggatggtgaccttcaagaccgcccacgccaagaggcaggacgtgaccgtgctgg gcagccaggagggcgccatgcacagcgccctgaccggcgccaccgaggtggacagcgg cgacggcaacctgctgttcaccggccacctgaagtgcaggctgaagatggacaagctg cagctgaagggcatgagctacaccatgtgcaccggcaagttccagatcgtgaaggaga tcgccgagacccagcacggcaccatcctgatcaaggtggagtacaagggcgaggacgc cccctgcaagatccccttcagcaccgaggacggccagggcaaggcccacaacggcagg ctgatcaccgccaaccccgcggtgaccgacaaggagaagcccgtgaacatcgagctgg agccccccttcggcgacagctacatcgtggtgggcgccggcgacaaggccctgaagct gagctggttcaagaagggcagcagcatcggccagatgttcgagagcacctacaggggc gccaagaggatggccatcctgggcgagaccgcctgggacctgggcagcatcggcggcg tgttcaccagcgtgggcaag 324. 17H10 atgaggtgcatcggcatcagcaacagggacttcgtggagggcgtgagcggcggcagct NPRM gggtggacgtggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcc Round II caccctggacatcgagctgctgaagaccgaggtgaccaaccccgccaccctgaggaag shuffled tactgcatcgaggccaagatcaccaacatcaccaccgacagcaggtgccccacccagg clone no gcgaggccatcatgcccgaggagcaggaccagaactacgtgtgcaagcacacctacgt PRM ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcctggtgacctgc gccaagttcacctgcaagaagaacatggagggcaaggtggtgcagcacgagaacctga agtacaccgtgatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgacac ccagggcgtgaccgtggagatcaccccccaggccagcaccgtggaggccatcctgccc gagtacggcaccctgggcctggagtgcagccccaggactggcctggacttcaacgaga tgatcctgctgaccatgaagaacaaggcctggatggtgcacaggcagtggttcctgga cctgcccctgccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaag gagaggatggtgaccttcaagaccgcccacgccaagaagcaggacgtggtggtgctgg gcagccaggagggcgccatgcacaccgccctgaccggcgccaccgaggtggacagcgg cgacggcaacctgctgttcaccggccacctgaagtgcaggctgaagatggacaagctg cagctgaagggcatgagctacgtgatgtgcaccggcaagttccagatcgtgaaggaga tcgccgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcgacggcgc cccctgcaagatccccttcagcaccgaggacggccagggcaaggcccacaacggcagg ctgatcaccgccaaccccatcgtgatcgacaaggagaagcccgtgaacatcgagctgg agccccccttcggcgacagctacatcgtggtgggcgccggcgacaaggccctgaagat caactggtacaagaagggcagcagcatcggcaagatgttcgaggccaccgccaggggc gccaggaggatggccatcctgggcgacaccgcctgggacttcggcagcgtgggcggcg tgctgaccagcctgggcaagatggtgcaccagatcttcggcagcgtgtacaccaccat gttcggc 325. 17H12- atgaggtgcgtgggcgtgggcaacagggacttcgtggagggcctgagcggcgccacct NPRM gggtggacgtggtgctggagcacggcagctgcgtgaccaccatggcccagggcaagcc Round II caccctggacatcgagctgctgaagaccgaggtgaccaaccccgccaccctgaggaag shuffled tactgcatcgaggccaagctgaccaataccaccaccgagagcaggtgccccacccagg clone no gcgagccctacctgaaggaggagcaggaccagcagtacatctgcaggagggacgtggt PRM ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgc gccaagttcaagtgcgtgaccaagctggagggcaagatcgtgcagtacgagaacctga agtacagcgtgatcgtgaccgtgcacaccggcgaccagcaccaggtgggcaacgagac caccgagcacggcaccatcgccaccatcaccccccaggcccccaccagcgagatccag ctgaccgactacggcgccctgaccctggactgcagccccaggaccggcctggacttca acagggtggtgctgctgaccatgaagaagaagagctggctggtgcacaagcagtggtt cctggacctgcccctgccctggaccgccggcgccgacaccagcgaggtgcactggaac cacaaggagaggatggtgaccttcaagaccgcccacgccaagaagcaggacgtggtgg tgctgggcagccaggagggcgccatgcacaccgccctgaccggcgccaccgaggtgga cagcggcgacggcaacctgctgttcaccggccacctgaagtgcaggctgaagatggac aagctgcagctgaagggcatgagctacagcatgtgcaccggcaagttccagatcgtga tgctgaccagcctgggcaaggccgtgcaccaggtgttcggcagcgtgtacaccaccat gttcggc 328. 18D7-NPRM atgaggtgcatcggcatcagcaacagggacttcgtggagggcgtgagcggcgccacct Round II gggtggacgtggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcc shuffled caccctggacatcgagctgctgaagaccgaggtgaccaaccccgccgtgctgaggaag clone no ctgtgcatcgaggccagcatcagcaacatcaccaccgccaccaggtgccccacccagg PRM gcgaggccatcctgcccgaggagcaggaccagaactacgtgtgcaagcacacctacgt ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcctgatcacctgc gccaagttcaagtgcgtgaccaagctggagggcaagatcgtgcagtacgagaacctga agtacagcgtgatcgtgaccgtgcacaccggcgaccagcaccaggtgggcaacgacac ccagggcgtgaccgtggagatcaccccccaggccagcaccgtggaggccatcctgccc gagtacggcaccctgggcctggagtgcagccccaggaccggcctggacttcaacgaga tgatcctgctgaccatgaagaacaaggcctggatggtgcacaggcagtggttcttcga cctgcccctgccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaag gagaggatggtgaccttcaagaccgcccacgccaagaagcaggacgtggtggtgctgg gcagccaggagggcgccatgcacaccgccctgaccggcgccaccgagatccagatgag cagcggcaccaccaccatcttcgccggccacctgaagtgcaggctgaagatggacaag ctgcagctgaagggcatgagctacagcatgtgcaccggcaagttccagatcgtgaagg agatcgccgagacccagcacggcaccatcctgatcaaggtgaagtacgagggcaccga cgccccctgcaagatccccttcagcagccaggacgagaagggcgtgacccagaacggc aggctgatcaccgccaaccccatcgtgaccgagaaggacagccccgtgaacatcgagg ccgagccccccttcggcgagagctacatcgtggtgggcgccggcgagaaggccctgaa gctgagctggttcaagaagggcagcagcatcggccagatgttcgagagcacctacagg ggcgccaagaggatggccatcctgggcgagaccgcctgggacttcggcagcgtgggcg gcctgctgaccagcctgggcaaggccgtgcaccaggtgttcggcagcgtgtacaccac catgttcggc 329. 18H2-NPRM atgaggtgcgtgggcgtgggcaacagggacttcgtggagggcgtgagcggcggcgcct Round II gggtggacctggtgctggagcacggcagctgcgtgaccaccatggccaagaacaagcc shuffled caccctggacatcgagctgctgaagaccgaggtgaccaaccccgccaccctgaggaag clone no tactgcatcgaggccaagatcaccaacatcaccaccgacagcaggtgccccacccagg PRM gcgaggccatcctgcccgaggagcaggaccagaactacgtgtgcaagcacacctacgt ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcgtggtgacctgc gccaagttccagtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctga agtacaccgtgatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgacac cagcaaccacggcgtgaccgtggagatcaccccccaggccagcaccgtggaggccatc ctgcccgagtacggcaccctgggcctggagtgcagccccaggaccggcctggacttca acagggtggtgctgctgaccatgaagaagaagagctggctggtgcacaagcagtggtt cctggacctgcccctgccctggaccgccggcgccgacaccagcgaggtgcactggaac cacaaggagaggatggtgaccttcaaggtgccccacgccaagaagcaggacgtggtgg tgctgggcagccaggagggcgccatgcacaccgccctggccggcgccaccgagatcca gatgagcagcggcaacctgctgttcaccggccacctgaagtgcaggctgaagatggac aagctgcagctgaagggcatgagctacagcatgtgcaccggcaagttccagatcgtga aggagatcgccgagacccagcacggcaccatcgtgatcagggtgcagtacgagggcga cggcgccccctgcaagatccccttcagcaccgaggacggccagggcaaggcccacaac ggcaggctgatcaccgccaaccccgtggtgaccaagaaggaggagcccgtgaacatcg aggccgagccccccttcggcgagagcaacatcgtgatcggcatcggcgacaaggccct gaagatcaactqgtacaagaagggcagcagcatcggcaagatgttcgagaccaccatg aggggcgccaagaggatggccatcttgggcgacaccgcctgggacttcggcagcgtgg gcggcctgctgaccagcctgggcaagatggtgcaccagatcttcggcagcgtgtacac caccatgttcggc 330. 18H6-NPRM atgaggtgcgtgggcgtgggcaacagggacttcgtggagggcctgagcggcgccacct Round II gggtggacgtggtgctggagcacggcggctgcgtgaccaccatggccaagaacaagcc shuffled caccctggacttcgagctgatcaagaccaccgccaaggaggtggccctgctgaggacc clone no tactgcatcgaggccagcatcagcaacatcaccaccgccaccaggtgccccacccagg PRM gcgaggccatcctgcccgaggagcaggaccagaactacgtgtgcaagcacacctacgt ggacaggggctggggcaacggctgcggcctgttcggcaagggcagcctggtgacctgc gccaagttcacctgcaagaagaacatggagggcaacatcgtgcagcccgagaacctgg agtacaccatcgtgatcaccccccacaccggcgaccagcaccaggtgggcaacgacac ccagggcgtgaccgtggagatcaccccccaggccagcaccgtggaggccatcctgccc gagtacggcaccctgggcctggagtgcagccccaggactggcctggacttcaacaggg tggtgctgctgaccatgaagaagaagagctggctggtgcacaagcagtggttcctgga cctgcccctgccctggaccgccggcgccgacaccagcgaggtgcactggaaccacaag gagaggatggtgaccttcaagaacgcccacgccaagaggcaggacgtgaccgtgctgg gcagccaggagggcgccatgcacagcgccctggccggcgccaccgagatccagatgag cagcggcaacctgctgttcaccggccacctgaagtgcaggctgaagatggacaagctg cagctgaagggcgtgagctacgtgatgtgcaccggcagcttcaagctggagaaggagg tggccgagacccagcacggcaccgtgctggtgcaggtgaagtacgagggcaccgacgc cccctgcaagatccccttcagcaccgaggacggccagggcaaggcccacaacggcagg ctgatcaccgtgaaccccatcgtgatcgacaaggagaagcccgtgaacatcgagctgg agccccccttcggcgagagctacatcgtggtgggcgccggcgagaaggccctgaagct gagctggttcaagaagggcagcagcatcggcaagatgttcgaggccaccgccaggggc gccaagaggatggccatcctgggcgagaccgcctgggacttcggcagcgtgggcggcc tgctgaccagcctgggcaaggccgtgcaccaggtgttcggcgccatctacggcgcc 331. pMaxVax acacatagcgccggcgctagctgagcaaaaggccagcaaaaggcca (“pMV” or “PMV”) primer 332. pMaxVax aactctgtgagacaacagtcataaatgtacagatatcagaccaagtttactcatatat primer 2 ac 333. pMaxVax ggcttctcacagagtggcgcgccgtgtctcaaaatctct primer 3 334. pMaxVax ttgctcagctagcgccggcgccgtcccgtcaagtcagcgt primer 4 335. pMaxVax agatctgtttaaaccgctgatcagcctcgactgtgccttc primer 5 336. pMaxVax acctctaaccactctgtgagaagccatagagcccaccgca primer 6 337. pMaxVax ggatccggtacctctagagaattcggcggccgcagatctgtttaaaccgctga primer 7 338. DEN-1 E- MRCVGTGNRDFVEGLSGATWVDVVLEHGSCVTTMAKIKPTLDIELLKTEVTNPAVLRK truncated LCIEAKISNTTTDSRCPTQGEATLVEEQDTNFVCRRTFVDRGWcNGCGLFGKGsLITC parent AKFKCVTKLEGKIVQYENLKYSVIVTTHTGDQHQVGNETTEHGIIAIITPQApTsEIQ (tE) LTDYGALTLDCSPRTGLDFNRVVLLTMKKKSWLVHKQWFLDLPLPWTSGASTSQETWN RQDLLVTFKTAHAKKQEVTVLGSQEGANHTALTGATEIQTSGTTIIFAGHLKCRLKMD KLTLKGVSYVMCTGSFKLEKEVAETQHGTVLVQVKYEGTDAPCKIPFSSQDEKGVTQN GRLITANPIVTDKEKPVNIEAEPPFGESYTVVGAGEKALKLSWFKKGSSTGKMFEATA RGARRMAILGDTAWDLGSIGGVPTSVGK 339. DEN-2 E- MRCIGISNRDFVEGVSGGSWVDIVLEHGSCVTTMAKNKPTLDFELIKTEATQPATLRK truncated YCIEAKLTNTTTESRCPTQGEPSLNEEQDKRFVCKHSMVDRGWGNGCGLFGKGGIVTC parent AMFTCKKNMEGNIVQPENLEYIIVTTPHSGEEHAVGNDTGKHGKEIKITPQSSITEAE (tE) LTGYGTVTMECSPRTGLDFNEIVLLQMEDKAWLVHRQWFLDLPLPWLPGADTQGSNRI QKETLVTFKNPHAKKQDVVVLGSQEGAMHTALTGATEIQMSSGNLLFTGHLKCRLRMD KLQLKGMSYSMCTGKFQIVKEIAETQHGIIVTRVQYEGDGSPCKTIPLEIMDLEKRVL GRLITVNPIVTEKDSPVNIEAEPPFGDSYIIIGVEPGQLKLHWPKKGSSIGQMFETTM RGAKPMAILGDTAWDFGSLGGVETSIGKALHQVFGAIYGA 340. DEN-3 E- MPCVGVGNRDFVEGLSGATWVDVVLEHGGCVTTNAKNKPTLDIELQKTEATQLATLRK truncated LCTEGKITNITTDSRCPTQGEAILPEEQDQNYVCKHTYVDRGWGNGCGLFGKGSLVTC parent AKFQCLEPTEGKVVQHENLKYTVIITVHTGDQHQVGNDTQGVTVEITPQASTVEAILP (tE) EYGTLCLECSPRTGLDFNEMILLTNKNKAWMVHRQWFFDLPLPWTSGATTETPTWNRK ELLVTFKNAHAKKQEVVVLGSQEGAIHTALTGTTETQNSGGTSIFAGHLKCRLKMDKL ELKGMSYAMCLNTFVLKKEVSETQHGTILIKVEYKGEDAPCKIPFSTEDGQGKAHNGR LITANPVVTKKEEPVNIEAEPPFGESNIVIGIGDKALKINWYKKGSSIGKMFEATARG ARRMAILGDTAWDFCSVCGVLNSLGKMVHQIFGSAYT 341. DEN-4 E- MRCVGVGNRDFVEGVSGGAWVDLVLEHGGCVTTMAQGKPTLDFELIKTTAKEVALLRT truncated YCIEASISNITTATRCPTQGEPYLKEEQDQQYICRRDVVDRGWGNGCGLFGKGGVVTC parent AKFSCSGKITGNLVQIENLEYTVVVTVHNGDTHAVGNDTSNHGVTATTTPrsPSVEVK (tE) LPDYGELTLDCEPRSGIDFNEMILMKNKKKTWLVIKQWFLDLPLPWTAGADTSEVHWN HKERVTFKVPHAKRQDVTVLGSQEGAHSALAGATEVDSGDCNHMFAGHLKCKVRIVIE KLRIKGMSYTMCSGKFSIDKEMAETQHGTTVVKVKYEGTGAPCKVPIEIRDVNKEKVV GRTTSSTPLAENTNSVTNIELEPPFGDSYIVIGVGDSALTLHWFRKGSSIGKMFESTY RGAKRMATLGETAWDFCSVGGLLTSLGKAVHQVFGSVYTTMFG 342. 30f10 atgagatctaccatcaccctgctctgcctgatccctactgttatggccttctctctgtcaacaa (c15/full gagatggcgagcctctcatgatcgtggccaagcacgaaagagggaggcctctgctgttcaagac prm/full tactgaagggatcaataagtgcactctcatcgccatggacctgggcgagatgtgtgaggatacc E) gtgacctacaagtgtccactgctggtcaacaccgaacccgaggatatcgattgctggtgcaatc tgacttctacttgggtgatgtatgggacctgtacccagtccggagagagaaggagggagaagag gtccgtcgccctggctcctcacgttggtatgggcctggacaccagaactcagacatggatgagc gctgagggagcttggaagcacgcccagagggttgaatcttggattctgcgcaatccacgctttg cactcctggctggcttcatggcttatatgattgggcagactggtattcagaggaccgtcttctt cgttctgatgatgctcgtggcaccatcttacggcatgaggtgcgtgggcgtgggcaacagggac ttcgtggagggcgtgagcggcggcgcctgggtggacgtggtgctggagcacggcagctgcgtga ccaccatggccaagaacaagcccaccctggacatcgagctgctgaagaccgaggtgaccaaccc cgccaccctgaggaagtactgcatcgaggccaagatcaccaacatcaccaccgacagcaggtgc cccacccagggcgaggccaccctggtggaggagcaggacaccaacttcgtgtgcaggaggacct tcgtggacaggggctggggcaacggctgcggcctgttcggcaagggcggcatcgtgacctgcgc caagttcaagtgcgtgaccaagctggagggcaacatcgtgcagcccgagaacctggagtacacc atcgtgatcaccccccacaccggcgaccagcaccaggtgggcaacgagaccaccgagcacggca ccatcgccaccatcaccccccaggcccccaccagcgagatccagctgaccgactacggcgccct gaccctggactgcagccccaggaccggcctggacttcaacagggtggtgctgctgaccatgaag aagaagacctggctggtgcacaagcagtggttcctggacctgcccctgccctggaccgccggcg ccagcaccagccaggagacctggaacaggaaggagctgctggtgaccttcaagaacgcccacgc caagaagcaggaggtggtggtgctgggcagccaggagggcgccatgcacaccgccctgaccggc accaccgagatccagaacagcggcggcaccctgctgttcaccggccacctgaagtgcaggctga ggatggacaagctgcagctgaagggcatgagctacagcatgtgcaccggcaagttccagatcgt gaaggagatcgccgagacccagcacggcaccatcgtgatcaaggtggagtacaagggcgaggac gccccctgcaagatccccttcagcaccgaggacgagaagggcgtgacccagaacggcaggctga tcaccgccaaccccatcgtgaccgagaaggacagccccgtgaacatcgaggccgagcccccctt cggcgagagctacatcgtggtgggcatcggcgacaaggccctgaagatcaactggtacaagaag ggcagcagcatcggcaagatgttcgaggccaccgccaggggcgccaggaggatggccatcctgg gcgagaccgcctgggacttcggcagcgtgggcggcctgctgaccagcctgggcaaggccgtgca ccaggtgttcggcagcgtgtacaccacaatgtttggcggcgtctcttggatggtgagaatcctg atcgggttcctcgtcctgtggattggaaccaatagcaggaatacaagcatggccatgagccgca tcgctgttggcggcatcacactcttcctgggtttcaccgttcacgca 343. 30F10 MRSTITLLCLIPTVMAFSLSTRDGEPLMIVAKHERGRPLLFKTTEGINKCTLIAMDLGEM (C15/full CEDTVTYKCPLLVNTEPEDIDCWCNLTSTWVMYGTCTQSGERRREKRSVALAPHVGM prm/full GLDTRTQTWMSAEGAWKHAQRVESWILRNPRFALLAGFMAYMAGQTGIQRTVFFVL E) MMLVAPSYGMRCVGVGNRDFVEGVSGGAWVDVVLEHGSCVTMAKNKPTHDIELLK TEVTNPATLRKYCIEAKITNIITDSRCPTQGEATHVEEQDTNFVCRRTFVDRGWGNGCG LFGKGGIVTCAKFKCVTKLEGNIVQPENLEYIIVITPHTGDQHQVGNETTHHGIIAIITP QAPTSEIQLTDYGALTLDCSPRTGLDFNRVVLLTMKKKTWLVHKQWFLDLPLPWTAG ASTSQETWNRKELLVTFKNAHAKKQEVVVLGSQEGAMHTALTGIITEIQNSGGTLLFT GHLKCRLRMDKLQLKGMSYSMCTGKFQIVKEIAETQHGIIVIKVEYKGEDAPCKIPFST EDEKGVTQNGRLITANPIVTEKDSPVNIEAEPPFGESYIVVGIGDKALKINWYKKGSSIG KMTEATARGARRMAILGETAWDFGSVGGLLTSLGKAVHQVFGSVYTTMFGGVSWMV RJLIGFLVLWIGTNSRNTSMAMSCIAVGGITLFLGFTVHA 344. 30G2 atgagatctgtgaccatgattctcatgctgctgcctactgctctggccttccatctgacaacaa (c15/full gagatggcgagcctaggatgatcgtgggcaagaacgaaagagggaagagcctgctgttcaagac prm/full tgcttcagggatcaatatgtgcactctcatcgccatggacctgggcgagatgtgtgaggatacc E) atgacctacaagtgtccacggatgaccgaggccgaacccgacgatgtggattgctggtgcaatg caactgatacttgggtgacctatgggacctgtagccagacaggcgagcataggagggataagag gtccgtcgccctggaccctcacgttggtctgggcctggaaaccagaaccgagacatggatgagc tctgagggagcttggaagcacgcccagagggttgaatcttggattctgcgcaatccacgctttg cactcctggctggcttcatggcttatatgattgggcagactggtattcagaggaccgtcatctt cattctgctcatgctcgtgacaccatctatggccatgaggtgcgtgggcatcggcaaccgcgac ttcgtggagggcctgagcggcgccacctgggtggacgtggtgctggagcacggcagctgcgtga ccaccatggccaagaacaagcccaccctggacatcgagctgctgaagaccgaggtgaccaaccc cgccgtgctgaggaagctgtgcatcgaggccaagatcagcaacaccaccaccgacagcaggtgc cccacccagggcgaggccatcctgcccgaggagcaggaccagaactacgtgtgcaagcacacct acgtggacaggggctggggcaacggctgcggcctgttcggcaagggcagcctggtgacctgcgc caagttccagtgcctggagcccatcgagggcaaggtggtgcagcacgagaacctgaagtacacc gtgatcatcaccgtgcacaccggcgaccagcaccaggtgggcaacgagaccaccgagcacggca ccatcgccaccatcaccccccaggcccccaccagcgagatccagctgaccgactacggcgccct gaccctggactgcagccccaggaccggcctggacttcaacagggtggtgctgctgaccatgaag aagaagacctggctggtgcacaagcagtggttcctggacctgcccctgccctggaccgccggcg ccagcaccagccaggagacctggaacaggaaggagctgctggtgaccttcaagaacgcccacgc caagaagcaggaggtggtggtgctgggcagccaggagggcgccatgcacaccgccctgaccggc gccaccgaggtggacagcggcgacggcaacctgctgttcaccggccacctgaagtgcaggctga ggatggacaagctgcagctgaagggcatgagctacagcatgtgcaccggcaagttccagatcgt gaaggagatcgccgagacccagcacggcaccatcgtgatcagggtgaagtacgagggcaccgac gccccctgcaagatccccttcagcagccaggacgagaagggcgtgacccagaacggcaggctga tcaccgccaaccccatcgtgatcgacaaggagaagcccgtgaacatcgaggccgagcccccctt tggcgacagctacatcatcatcggcgtggagcccggccagctgaagctgcactggttcaagaag ggcagcagcatcggccagatgttcgagaccaccatgaggggcgccaagaggatggccatcctgg gcgacaccgcctgggacttcggcagcctgggcggcgtgttcaccagcatcggcaaggccctgca ccaggttttcggtgcaatctatggcgtgctgtttttcggcgtctcttggaccatgaagatcggt attggcatcctcctcacatggctgggactgaatagcaagaatacaagcatgagctttagctgca tcgctattggcatcatcacactctacctgggtgtggtggttcaggca 345. 30G2 MRSVTMILMLLPTALAFHUFfRDGEPRMIVGKNERGKSLLFKTASGINMcTLIAMDLG (C15/full EMCEDTMTYKCPRMTEAEPDDVDCWCNATDTWVTYGTCSQTGEHRRDKRSVAIDPH prm/full VGLGLETRTETWMSSEGAWKHAQRVESWILRNPRFALLAGFMAYMIGQTGIQRTVIFI E) LLMLVTPSMAMRCVGIGNRDEVEGLSGATWVDVVLEHGSCVTfMAKNKPTHDIELLK TEVTNPAVLRKLCIEAKISNTTDSRCPTQGEAILPEEQDQNYVCKJITYVDRGWGNGC GLFGKGSLVTCAKFQCLEPIEGKVVQHENLKYTVHTVHTGDQHQVGNEPTEHGIIAII TPQAPTSEIQLTDYGALTLDCSPRTGLDFNRVVLLTMKKKTWLVHKQWELDLPLPWT AGASTSQETWNRKELLVTFKNAHAKKQEVVVLGSQEGAMHTALTGATEVDSGDGNL LFRGHLKCRLRMDKLQLKGMSYSMCTGKFQIVKEIAETQHGIIVMVKYEGTDAPCKIP FSSQDEKGVTQNGRLITANVIDKEKPVNLEAEPPFGDSYIIIGVEPGQLKLHWFKKGSS IGQMFERTMRGAKRMAILGDTAWDFGSLGGVFTSIGKALHQVFGAIYGVLFFGVSWT MKIGIGILLTWLGLNSKNTSMSFSCIAIGIITLYLGVVVQA

While the foregoing invention has been described in some detail for purposes of clarity and understanding, it will be clear to one skilled in the art from a reading of this disclosure that various changes in form and detail can be made without departing from the true scope of the invention. It is understood that the examples and embodiments described herein are for illustrative purposes only and that various modifications or changes in light thereof will be suggested to persons skilled in the art and are to be included within the spirit and purview of this application and scope of the appended claims. For example, all the techniques and apparatus described above may be used in various combinations.

All references, including publications, patent applications, patents, and/or other documents cited herein, including those not specifically indicated as being incorporated by reference when cited above, are each hereby incorporated by reference in their entirety for all purposes to the same extent as if each such reference were individually and specifically indicated to be incorporated by reference herein in its entirety for all purposes and were set forth in its entirety herein.

The use of the terms “a” and “an” and “the” and similar referents in the context of describing the invention (especially in the context of the following claims) are to be construed to cover both the singular and the plural, unless otherwise indicated herein or clearly contradicted by context. Terms such as “including,” “having,” “comprising,” “containing,” and the like are to be construed as open-ended terms (i.e., meaning “including, but not limited to”) unless otherwise indicated, and as encompassing the phrases “consisting of” and “consisting essentially of.” Recitation of ranges of values herein are merely intended to serve as a shorthand method of referring individually to each separate value of the range, unless otherwise indicated herein, and each separate value is incorporated into the specification as if it were individually recited herein. All methods described herein can be performed in any suitable order unless otherwise indicated herein or otherwise clearly contradicted by context. The use of any and all examples, or exemplary language (e.g., “such as”) provided herein, is intended merely to better illuminate the invention and does not pose a limitation on the scope of the invention unless otherwise claimed. No language in the specification should be construed as indicating any non-claimed element as essential to the practice of the invention. The headings provided in the description of the invention are included merely for convenience and are not intended to be limiting in the scope of the disclosure.

The citation of any patent or patent document herein does not reflect any view concerning the patentability of the subject matter described or claimed in such patent documents. Rather, such patent documents may be cited merely to provide convenient reference for suitable techniques and compositions, including techniques and compositions otherwise well known in the art.

All amino acid or nucleotide sequences of one of the aforementioned sequence patterns are to be considered individually disclosed herein. Thus, for example, an amino acid sequence pattern of three residues, where a “Xaa” represents one of the amino acid positions in the pattern represents a disclosure of twenty different amino acid sequences (i.e., one sequence for each naturally occurring amino acid residue that could be present in the Xaa position).

Any of the techniques and any of the characteristics of the viral vector particle compositions of the invention can be combined in any suitable manner, unless otherwise stated or clearly contradicted by context.

Preferred embodiments and aspects of this invention are described herein, including the best mode known to the inventors for carrying out the invention. Variations of those preferred embodiments may become apparent to those of ordinary skill in the art upon reading the foregoing description. The inventors expect skilled artisans to employ such variations as appropriate, and the inventors intend for the invention to be practiced otherwise than as specifically described herein. Accordingly, this invention includes all modifications and equivalents of the subject matter recited in the claims appended hereto as permitted by applicable law. Moreover, any combination of the above-described elements in all possible variations thereof is encompassed by the invention unless otherwise indicated herein or otherwise clearly contradicted by context. 

1. A recombinant or synthetic polypeptide comprising an amino acid sequence that has at least 95 percent amino acid sequence identity to the full-length sequence set forth in SEQ ID NO:239.
 2. The recombinant or synthetic polypeptide of claim 1, wherein the recombinant or synthetic polypeptide induces an immune response in a subject against at least one flavivirus.
 3. The recombinant or synthetic polypeptide of claim 1, wherein the recombinant or synthetic polypeptide induces an immune response in a subject against at least one dengue virus.
 4. The recombinant or synthetic polypeptide of claim 1, wherein the recombinant or synthetic polypeptide induces an immune response in a mammal against dengue viruses of multiple serotypes.
 5. The recombinant or synthetic polypeptide of claim 1, wherein the recombinant or synthetic polypeptide induces an immune response in a mammal against at least one dengue virus of at least one serotype selected from the group of dengue-1, dengue-2, dengue-3, and dengue-4 that is about equal to or greater than an immune response induced in the mammal against the at least one dengue virus of the at least one serotype by a dengue virus antigen selected from the group of SEQ ID NOS:227-230.
 6. The recombinant or synthetic polypeptide of claim 1, wherein the recombinant or synthetic polypeptide comprises the full-length amino acid sequence set forth in SEQ ID NO:239.
 7. The recombinant or synthetic polypeptide of claim 1, wherein the recombinant or synthetic polypeptide induces production of antibodies capable of binding to a first dengue virus of a first serotype and a second dengue virus of a second serotype, wherein the first serotype is different from the second serotype.
 8. A composition comprising at least one recombinant or synthetic polypeptide and an excipient or carrier, wherein said polypeptide comprises an amino acid sequence that has at least 95 percent amino acid sequence identity to the full-length sequence set forth in SEQ ID NO:239.
 9. A virus-like particle comprising at least one polypeptide, wherein said polypeptide comprises an amino acid sequence that has at least 95 percent amino acid sequence identity to the full-length sequence set forth in SEQ ID NO:239.
 10. A method of producing antibodies against at least one dengue virus of at least one serotype comprising administering to a subject at least one recombinant or synthetic polypeptide in an amount sufficient to induce antibody production, wherein said polypeptide comprises an amino acid sequence that has at least 95 percent amino acid sequence identity to the full-length sequence set forth in SEQ ID NO:239.
 11. The method of claim 10, wherein a population of neutralizing antibodies against at least one dengue virus selected of at least one serotype selected from the group of dengue-1, dengue-2, dengue-3, and dengue-4 is induced.
 12. A recombinant or synthetic polypeptide comprising an amino acid sequence that has at least 95 percent amino acid sequence identity to the full-length amino acid sequence set forth in SEQ ID NO:239, wherein the amino acid sequence of said polypeptide does not include the first 16 amino acid residues of SEQ ID NO:239, and wherein the recombinant or synthetic polypeptide induces an immune response in a subject against at least one dengue virus. 